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The development of a mammalian cell line that overexpresses Myc and the purification of significant quantities of c-Myc 
from these cells is described. Three types of c-Myc-driven protein oligomerization (or complex) formations are described: (1) ho- 
mo-oHgomer complexes (herein termed CI complexes) formed by association of at least two peptides of c-Myc, (2) hetero-oligom- 
er complexes (herein termed C2 complexes) formed by heterodimerization of at least two peptides, at least one of which is not the 
c-Myc peptide, and specifically hetero-oligomerization between c-Myc and a 26-29 kd factor, and (3) c-Myc-dependent hetero-ol- 
igomeric complexes (herein termed the C2' complex) formed in the presence of c-Myc, however such hetero-oligomeric proteins 
not containing any peptides which are c-Myc. TTie invention is further directed to a reliable and accurate method for objectively 
classifying compounds, including human pharmaceuticals, as inhibitors of c-Myc activity. 
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TITLE OF THE INVENTION 



C-MYC DNA BINDING PARTNERS, MOTIFS, SCREENING 
ASSAYS, AND USES THEREOF 

Cross-Reference to Related Applications 
This s^lication is a contixiuation-in-part of U.S. Application No, 
07/510,253, filed April 19, 1990, 

Field of the Invention 

This invention is directed to methods for the purification of 
mammalian Myc protein, and methods for the identification of compounds 
that inhibit c-Myc transcriptional activity. 

BACKGROUND OF T WF TNVFNTrON 

Myc is a nuclear oncogene whose abeirant expression is associated 
with many different types of human cancers in many different tissues 
(Cole, M.D., Ann. Rev. Genet. 20:361-384 (1986)). While the mechanism 
of c-Myc oncoprotein action remains unknown, it clearly plays a role in the 
control of cell growth and differentiation (Luscher and Eisenman, Genes & 
Dev. -#:2025-2035 (1990); Penn et al., Sem. Cancer Biol 7:69 (1990)). 
One plausible mechanism of Myc action is as a regulator of transcription in 
a pathway directly controlling proliferation and differentiation. This model 
is consistent with several observations. First, Myc has long been known as 
a nuclear protdn with a general affinity for DNA (Abrams a al. , Cell 
2P:427-439 (1982); Alitalo «r fl/., Namre 306:TJ4^2TJ (1983); Donner 
etal.. Nature 295:262-265 (1982); Pcrsson and Leder, Science 225:718- 
721 (1984)), and recently a site has been identified which is specifically 
bound by bacterially expressed variants of c-Myc (Blackwell et al.. Science 
250:1149-1151 (1990); Prendergast and Ziff, Science 257:186-189 (1991)). 
Second, full length c-Myc has been shown to both activate and repress 
genes in transient transfection assays (Kaddurah-Daouk et al. , Genes & 
Dev. i:347-357 (1987); Yang et al., MoL Cell. Biol ii:2291-2295 

CI iRcrm iTP <:hpft 



(1991)), and will weakly stimulate transcription when fused to a 
heterologous DNA-binding domain (Lech etal.,CeU 52:179-184 (1988); 
Kato «r fl/., MoL CdL Biol. ift5914-5920 (1990)). And finally, sequence 
similarities described bdow place Myc in the company of known 
transcription fectois. 

Myc contains two domains that suggest it oligomerizes, periiaps as a 
dimer, and binds spcdfically to DNA: a leucine zipper domain and a basic- 
hdix-loop-hdix (B-HLH) domain. The leudne zipper is an a-helical 
structure found in sequence spedfic DNA-binding proteins such as Fos and 
Jun where it mediates homo- or heterodimerization via a coiled-coiled 
interaction (Landschuiz ef aL, Sdaice 240:1759-1764 (1988); O'Shea 
et aL, Sdaice 245:538-542 (1989); and reviewed in Busch and Sassone- 
Corsi, TIG 5:36-40 (1990)). This dimerization is necessary for DNA 
binding (Gentz et d., Sdence 245:1695-1699 (1989); Halazonetis et al., 
CeaSSmi-SQA (1988); Kouzarides and 2fif, Nature 336:6^6-651 (1988); 
Turner and Tjian, Sdence 24i: 1689-1694 (1989)). The HLH region also 
appears to mediate oligomerization necessary for DNA binding in several 
developmeatally important proteins (Murre et al.. Cell 58:537-544 (1989); 
Mune et al.. Cell 55:777-783 (1989)). HLH proteins form a large and 
growing family and include the products of the achaete-scute and 
daughterless genes responsible for neural development in Drosophila, the R 
gene femily whidi r^ulates pigment pattern in com, MyoD and several 
other proteins involved in muscle specific differentiation in vertd)rates, and 
a centromere binding protein, CBFl, from yeast ^raun et aL , EMBO J. 
8:701-709 (1989); Cd and Davis, CeU 5J:437-446 (1990); Caudy et al.. 
Cell 55:1061-1061 (1988); Cromniller ef al.. Genes & Dev. 2:1666-1676 
(1988); Davis etal.,Cea 52:987-1000 (1987); Edmondson and Olson, 
Genes & Dev. 5:628-640 (1989); Ludwig and Wessler, Cell 52:849-851 
(1990); Pinney et al.. Cell 55:781-793 (1988); Rhodes and Konieczny, 
Genes & Dev. 5:2050-2061 (1989); Villares and Cabrera, Cell 50:415-424 
(1987); Wright etaL,Cell 55:607-617 (1989)). WhUe many proteins 
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contain either an HLH or leucine zipper motif, Myc is one of a smaller 
number of proteins which contain both an HLH and a leucine zipper. Both 
the leucine zipper containing proteins and the HLH proteins require a 
stretch of basic amino adds adjacent to the dimerization motif to contact 
5 DNA (reviewed in Busch and Sassone-Corsi, TIG 5:3640 (1990); Jones, 
N., Cell 61:9-11 (1990)). Interestingly, all B-ELH proteins appear to bind 
to closely related DNA sequences known as E-Boxes. These are sequence 
motifs found in the immunoglobulin and other tissue specific enhancers 
having a core of NNCANNTGNN [SEQ ID No. 16] where different central 

10 bases are preferred by different B-HLH proteins and the flanking bases can 
affect binding affinity (BlackweU et d.. Science 250:1149-1151 (1990); 
BlackweU and Weintiaub, Science 250: 1 104-1 1 10 (1990)). The core of the 
rqwrted binding site for c-Myc, CACCjTG, fits this pattern and has the 
same core sequence as the upstream sequoice element (USE) of the 

IS Adenovirus major late promoter (Blackwdl et aL , Science 250: 1 149-1 15 1 
(1990); Prendergast and Ziff, Science 2Ji: 186-189 (1991)). A ceUular 
transcription factor (USF or MLTF) which binds to die USE has recaitly . 
been cloned and also contains a B-HLH domain adjacent to a leucine zipper 
(Gregor et al.. Genes <t Dev. 4: 1730-1740 (1990)). 

20 Many of these B-HLH or leucine zipper proteins have been found to 

form not only homodimers but heterodimers with other proteins having like 
dimerization motifs (reviewed in Busch and Sassone-Corsi, TIG 5:36-40 
(1990); Jones, N., Cell 61:9-11 (1990)). Heterodimetization between 
specific groups of B-HLH or leucine zipper proteins can alter their DNA 

25 binding properties. While homodimers might bind weakly, heterodimers 
with the ^propriate paimer can bind with increased affinity and in some 
cases with a new spedficity (Jones, N., Cell 61:9-11 (1990); BlackweU and 
Wdntraub, Science 250:1104-1110 (1990); Wright et al., Mol. Cell. Biol. 
ii:4104-4110 (1991)). Myc is capable of forming a homo-oligomer at high 

30 concentrations in vitro (Dang et al, Namre 557:664-666 (1989); Kerkhoff 
and Bister, Oncogene 6:93-102 (1991)), although it is not clear whether 
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that homo-oligomer actually fonns in vivo (Dang et al., Mol. Cell Biol. 
11:954-961 (1991)). It seems likely that Myc directly interacts with other 
cellular pn>tein(s) to form heteio-oligomer(s), and indeed one such 
'partter* piotdn, designated M^, has recently been identified (Blackwood 
and Hsenmann, Sdaice 252:1211-1217 (1991)). The effect that such 
partner proteins have on Myc DNA-binding specificity is likely to be 
central to understanding the function of Myc. 

Much of the in vitro work done on B-HLH proteins has utilized in 
•vitro transcribed and translated protein or has used protein oveiexpressed in 
bacteria. Myc expressed by these means has been used to detennine 
binding specificity and to demonstrate tiiat Myc can form heterodimers with 
Max (BlackweU ef oL, Sdence 250:1149-1151 (1990); Prendeigast and 
Zifif, Sdence 25i: 186-189 (1991); Blackwood and B s wim ann, Sdence 
251:1211-1217 (1991)). Myc, however, is post-tianslationally modified by 
at least phosphorylation in mammalian cells ^ann and Eisenmann, Mol. 
Cell. Biol. 4:2486-2497 (1984); Ramsay et oL, Proc. Natl. Acad. Sd. USA 
87:7742-7746 (1984)), and post-translational modifications are believed to 
regulate the function of many proteins, including the transcription factors 
Myb, Fos, HSF, CREB, and SP-1 (Abate cf a/., Sdence 1157-1 161 
(1990); Jackson et al,. Cell 55:155-165 (1990); Luschcr et al., Nature 
54^:517-522 (1990); Sorger et al.. Nature 52P:81-84 (1987); Yamamoto 
ef al.. Nature 554:494-498 (1988)). In addition, Myc produced in avian 
cells has been rqxnted to bind more tigfati^y to DNA cellulose than 
bacterially produced Myc^(Kerkhoff and Bister, Oncogene 6:93-102 (1991)). 

Several lines of evidence argue tiiat the biochemical fimction(s) of 
Myc will be determined in large part by hetero-oligomerization witii Max 
and perii^s witfi otiier, as yet unidentified, fiacton. A complete 
understanding of the function of c-Myc will tfierefore require the 
identification of all parmer proteins and a functional characterization of the 
complexes that these proteins form in tiie absence or j)resence of c-Myc. 
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To unravel the nature of Myc's function it will be necessary to determine 
not only the binding properties of all relevant complexes but to ascertain 
how they differ in action once bound. Post-translational modification might 
play a role in modulating ttie formation, binding, or further activities of 
these complexes and the availability of large quantities of modified c-Myc, 
such as described here, should £adlitate a biochemical approach to this 
problem. Such studies should lead us to an understanding of the complexes 
available at different times in different cell types and the consequences for 
each cell in terms of appropriate growth and differentiation, or 
oncogenesis. 

Further, to date, no inhibitors of c-Myc action have been identified. 
The identification of such inhibitors has suffered for lack of identification 
of a specific DNA binding sequence to which c-Myc binds, and for lack of 
a simple, inexpensive and reliable screening assay which could rapidly 
identify potential inhibiton and active derivatives thereof/ Tlius a need also 
still exists for rapid, economical screening assays which identify specific 
inhibitors of c-Myc activity. 

SUMMARY OF THE INVENTION 

Recogniang the potential importance of inhibitors of c-Myc 
oncoprotein activity in the therapeutic treatment of many forms of cancer, 
and cognizant of the lack of a simple assay system in which such inhibitors 
might be identified, the inventors have investigated c-myc DNA binding. 

These efforts led to the development of a mammalian cell line that 
overexpresses Myc and the purification of significant quantities of c-Myc 
from these cells. These efforts culminated in the discovery f three types 
of c-Myc-driven protein oligomcrization (or complex) formations: (1) 
homo-oligomer complexes (herein termed CI complexes) formed by 
association of at least two peptides of c-Myc, (2) hetero-oligomer 
complexes (herein termed C2 complexes) formed by heterodimerization of 
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at least two peptides, at least one of which is not the c-Myc peptide, and 
spedficaUy hetero-oligomerization between c-Myc and a 26-29 kd fector, 
and (3) c-Myc-d^dent hetero-oUgomeric complexes (herein termed the 
C2' complex) formed in the presence of c-Myc, however such hetero^ 

5 oligomeric proteins not containing any pq»tides which are c-Myc. 

Accordingly, the invention is directed to a reliable and accurate 
method for the purification of Myc fitom a mammali a n source. 

Hie invention is further directed to the use of oUgomers containing 
the DNA motif 5*-CACGTG-3', in its double stranded DNA form, as a 

10 reliable and accurate method for the detection of the presence of CI 
conq)lexes in a saiaplQ. 

Hie invention is further directed to the use of the DNA motif 5*- 
CACGTG-3*, in its double stranded DNA form, as a reliable and accurate 
method for the detection of C2 complexes in a sample. 

15 The invention is further directed to the use of tiie DNA motif 5'- 

CAGCTG-3\ in its double stranded DNA form, as a reliable and accurate 
method for tiie detection of C2 ' complexes in a sample. 

Hie invention is further directed to a 26-29 kD protein fraction 
purified from Chinese hamster ovary (CHO) cells or baculovirus, such 

20 protein fraction containing at least one peptide capable of forming C2 
complex oligomers with c-Myc. 

The invention is further directed to a 40-50 IcD protein fraction 
purified from CHO cells, such protein fraction containing at least one 
p^tide capable of forming G2' complex oUgomers in the presence of c- 

25 Myc. 

The invention is further directed to a reliable and accurate method 
-for objectively classifying compounds, including human pharmaceuticals, as 
inhibitors of c-Myc activity, and espedaUy as an inhibitor of CI complex 
formation, C2 complex formation or C2' complex formation. 
30 The invention is further directed to a reliable and accurate method 

for objectively classifying compounds, including human pharmaceuticals, as 
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inhibitors of c-Myc activity, and especially as an inhibitor of CI complex 
DNA binding, C2 complex DNA binding, or C2' complex DNA binding. 

The invention fiirther provides a method for identifying and 
classifying the mechanism of action of a bioactive c-Myc-inhibiting 
5 compound. 

The invention further provides an assay for tiie monitoring of the 
isolation and/or purification of a pq)tide enable of forming a C2 or C2 ' 
complex, or a mixture of such pq)tides from a crude prqwration. 

The invention further provides an assay for the monitoring of the 
10 isolation and/or purification of an c-Myc-inhibiting compound or mixture of 
such compounds from a crude preparation of such compounds. 

pppry mrsrRTPTTON of thf. drawings 

Eg. 1. Purified c-Mvc Protein . A) 1 ng of c-Myc protein purified 
fiom the 5A overexpressing CHO cell line was subjected to 2-dimensional 

15 gd electrophoresis. An isoelectric focusing tube gel was run with pH 5-7 
ampholytes followed by SDS-PAGE and silver staining. The Myc proteins 
are bracketed and arrows distinguish tiie 60, 62, and 72 kD species. The 
gel was trimmed for this figure; tfie actual pi range for tiie Myc proteins 
was 5.0-5.6. B) 0.5 ng of purified c-Myc protein from tiie indicated cell 

20 lines was electiophoresed on an SDS gel and eitho- visualized by silver 
staining Q&ft lane) or electroblotted to nitrocellulose and subjected to 
immunoblotting using tiie ST-2 polyclonal antibody (right 2 lanes). 

Fig. 2. DNA Binding of Purified c-Myc Proteins . TheEMSAwas 
earned out as described in matcxials and mediods using equal amounts 

25 (approximately 2 ng) of the following probes and 0.5 /tg of dtiier purified 
CHO produced c-Myc or baculovirus produced c-Myc: OiE2)3 lanes 1 and 
7, OtE3)3 lanes 2 and 8, MLC-A lanes 3 and 9, MLC-B lanes 4 and 10, 
(USE)3 lanes 5, 11, and 12, and HSE lanes 6 and 13. Full probe 
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sequences are given in materials and methods. Lanes 1-6 and lanes 7-13 
are diifeient exposures of lanes ficom the same gel. 

Hg. 3. CA Binding Activity is Presp nt in Mvc containing Slices of 
SDS Gels. 400 /ig of CHO produced c-Myc or 163 fig of baculovirus 
5 produced c-Myc was sqarated on an SDS-PA(ffi geL Proteins from 0.5 
cm slices were xecovered, xeoatured as described in malaials and methods, 
and analyzed by EMSA the (USE)3 probe. 0.4 ng of the CHO Myc 
load and 5 nl of the protein from the CHO Myc-containing slice were 
analyzed on the same gel (left panel). 0.37 ng of the baculovirus Myc load 

10 and 5 /d of the protein from the baculovirus Myc slice were analyzed on 
the same gd (right panel). Slices from other molecular weight ranges of 
the same gel showed no binding (data not shown). 

Fig. 4. Activity is Fomed bv c-My c and a 26-29 kP Factor. 
Protons from gd slices were recovered and analyzed by EMSA as 

15 described in materials and mediods using the (US£)3 probe. Lanes 1-4 
represent proteins from the same gd loaded with baculovirus produced 
Myc described for Fig. 5. These lanes contain 0.37 fig of the loaded 
material (lane 1), 0.75 ftg BSA with 7.5 fd of proteins from dther a Myc 
slice Oane 2) or a 26-29 kD slice (lane 3), or 7.5 of each slice used for 

20 lanes 1 and 2 plus 0.2 fig of BSA (lane 4). Lanes 5-8 and 10 contain 

proteins from gels loaded with Myc purified from CHO cells. These lanes 
contain 0.47 of the gel load (lane 5), 4 fd of material from a Myc slice of a 
, gd loaded with 400 fig of Myc (lane 6), 7 /d of material from a 26-29 kD 
slice of a similar gd plus 0.8 ftg Protein A (lane 7), and both 4 fil of the 

25 Myc slice and 7 fd of the 26-29 kD slice Oane 8). Lanes 9-12 utilize the 
bacterially aqiressed Protdn A-Myc fiision proteins containing dther the 
Myc B-HLH and leudne zipper domains (amino adds 353-439) or lacking 
the basic region and containing Myc amino adds 372-439. These were 
expressed and purified as described in materials and methods. Lane 9 

30 contains 0.5 fig of Protein A-Myc(353-439) and lane 10 contains the same 
plus 7 fil of the 26-29 kD slice. Lane 1 1 contains 1 fig of Protein A- 
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Myc(372-439) and lane 12 contains 0.5 /xg of Protein A-Myc(372-439) plus 
7/il of the 26-29 kDsUce. 

Fig. 5. CI' Binding Ac tivity Requires a 40-50 Kd Factor. A) 101 
tig of CHO produced c-Myc was sq>arated on an SDS gel. Proteins were 
recovmd, resuspended in 100 fd, and renatured and analyzed by EMS A 
using the ERP3/4 probe. This pn^ contains the portion of the MLC 
enhancer that encompasses the /x£2 site. EMSA samples contained 0.3 /ig 
of the SDS gel load (lane 1), 7.5 /d of the proteins from the Myc slice 
(lane 2), or the 40-50 kD slice (lane 3), or 7.5 ^1 of both slices renatured 
together (lane 4). B) EMSA samples contained 0.9 iig purified baculovirus 
produced c-Myc (lane 5), 3 ii\ of proton from the 40-50 kD slice of a gel 
loaded with 400 /ig CHO produced c-Myc (lane 6), or both renatured 
together (lane 7^. The probe was ERPl/2. C) EMSA samples contained 
10 til (0.9 /ig) of bacterially produced c-Myc fusion protein containing Myc 
amino adds 353-439 (lane 8), 0.47 /ig of CHO produced c-Myc Oane 9), 5 
fi\ of protein from the 40-50 kD slice of a gel loaded with 400 /xg of the 
CHO Myc shown in lane 9 (lane 10), or 5 /xl of the same 40-50 kD 
material renatured in the presence of either 0.9 of the baculovirus 
produced Myc shown in lane 5 (lane 11), 2 ftl (0.18 /xg) of the bacterially 
produced Myc fusion protein containing Myc amino acids 353-439 (lane 
12), or 4 /xl (0.36 ftg) of the same bacterially produced Myc fusion protein 
(lane 13). The probe was ERPl/2. 

Fig. 6. Antibodies to c-Mvc Interact with the CI and C2 
Complexes , EMSA reactions were set up with the indicated Myc protein 
preparations (0.37 /xg baculovirus produced c-Myc or 0,47 /xg of CHO 
produced c-Myc). These reactions were preincubated 30 min on ice in the 
presence of the indicated antibody (a-Myc monoclonal 1F7 or a 
monoclonal directed against the lambda repressor, cl). 1 ng of SMS probe 
or /xE2-containing probe number 7 (see Fig. 7) was added subsequently and 
binding and electrophoresis were as usual. 
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Fig. 7. Oligonucleotides Selected from Rando m flffgiience after 8 
Rounds of EMSA . Sequences were selected fiom oligonucleotides 
containing 20 base pairs of random sequence using a reiterative EMSA 
procedure described in niateiials,and methods. Underiined nucleotides are 
ftom the PGR primer sites. Tables below the aligned sequences tabulate 
the frequency of each base in the 6 ilanldng positions surrounding the 
CACGTG motifs. Only bases next to a perfect fit of the CACGTG core 
were tabulated since sequences without this core were found not to function 
as high affinity binding sites (Fig. 8, and data not shown). Bold numbers 
adjacent to individual sequences indicate those oligonucleotides which were 
tested individually by EMSA in Fig. 8. Asterisks indicate additional 
sequences which were tested individually (data not shown). 

Fig. 8. .Sriected Sites form Predicted Complexes. EMSA was 
carried out using either 2.8 ng of the SMS probe or equal amounts (1 ng) 
of probes 1-11 indicated in Fig. 7. Probes l-ll were labeled and gel 
isolated in parallel and had approximately equal specific activities. Binding 
reactions contained either no additional protein (-), 0.37 ng of baculovirus 
produced c-Myc (B) or 0.47 /tg of CHO produced c-Myc (C). Free probe 
is visible at the bottom of the gel- 
Fig. 9. Off-Rate of the CI and C2 Complexes . The standard 
EMSA reaction was scaled up for 11 samples containing 0.4 /ig of purified 
baculovirus produced c-Myc per sample. Probe and competitor were 
(USE)3. After allowing 20 min for binding 20 ftl was loaded on a prerun 
EMSA gel as a measure of the starting amount of complex (ST) and 
enough cold competitor was added to the remaining sample to achieve a 
250 fold molar excess over probe. Immediately upon addition of 
competitor the sample was gently mixed and 20 ftl aliquots were loaded at 
the indicated times (0, 30 s, 1 min, 4 min, etc.). A control sample (C) 
was made up individually in which competitor was added prior to the start 
of binding to demonstrate complete competition. This sample was loaded 
at the same time as the ST sample. All samples were loaded on a 
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continuously running gel so that the complex in the starting lane runs ahead 
of the equivalent complex in lanes loaded later. 

DESCRIPTION OF THE PREFER l^T^ lilVfRQ DIMENTS 

In Hit description that follows, a number of terms used in 
5 recombinant DNA technology art extenavely utilized. In order to provide 
a clearer and consistent understanding of the specification and claims, 
including the scope to be given such temis, the following definitions are 
provided. 

Oligomer of Interest . As used herein, an "oligomer of interest" 

10 refers to any of the following types of oligomeric proteins: first, Myc- 
oontaining oligomers including homo-oligomers of Myc pq>tides (a CI 
complex), and hetero-oligomers containing at least one peptide of Myc and 
one pq)tide of a Myc "partner* (a CZ complex); second, oligomers that 
form in the presence of Myc-containing homo-oligomers or Myc-containing 

15 heterooligomers but which themselves do not contain the Myc peptide, 
such oligomers including non-Myc-containing homo-oligomers that 
associate in the presence of Myc and non-Myc-containing hetero-oligomers 
that associate in the presence of Myc (a C2 ' complex). 

Oligomer . An "oligomer" as it refers to proteins, means a protein 

20 composed of more than one pq)tide subunit, such as dimers, trimers, 

tetramers, etc. Such oligomeric protein may be a homo-oligomer, that is, 
composed entirely of two or more identical subunits; alternatively, such 
oligomeric protein may be a hetero-oligomer, that is, composed of at least 
two different pq)tides. Oligomers containing three or more pq)tides may 

75 contain more than one copy of a pq>tide. 

C2" Proteinfs) . As used herein, for convenience, a "C2' protein" is 
a protein or peptide that is a member of the second class of the "oligomers- 
of-interest," that is, a protein that forms oligomers in the presence of Myc, 
c-Myc homo-oli^'omers or Myc-containinp hetero-olifiomers so as to bind to 

CI IBCTm ITC QMPPT 
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a specific DNA sequence, but which does not contain a Myc peptide, such 
oligoraeis including non-Myc-containing horao-oligoraers that associate in 
the presence of Myc and non-Myc-containing hetenM)ligomers that 
associate in ihc presence of Myp. 

5 OpeTablv-linked. As used herein, two macromoiecular elements arc 

operably-linkBd when the two macromoiecular elements are physically 
arranged such that fectois which infliimce the activity of the first element 
cause the first element to induce an effect on the second dement For 
•example, the transcription of a coding sequence which is operably-linked to 

10 a promoter element is induced by fectors which "activate" the promoter's 
activity; transcription of a coding sequence which is qpetably-linked to a 
promoter dement is inhibited by factors which "rqpress" the promoter's 
activity. Thus, a promoter n^on would be operably-linked to the coding 
sequence of a protein if transcription of the coding sequence activity was 

15 influenced by the activity of the promoter. 

Response. As used herein, the term "response" is intended to refer 
to a change in any parameter which can be used to measure, indicate or 
otherwise describe c-Myc action or oligomer (homo-oligomer (CI complex) 
or heterooUgomer (C2 complex)) formation, including c-Myc dependent - 

20 hetero-oligomerization (C2 ' complex) formation. The response may be 

revealed as a physical change (such as a change in phenotype) or, it may be 
revealed as a molecular change (such as a change in a reaction rate or 
affinity constant). Detection of the response may be performed by any 
means appropriate. "Detecting" refers to any method by which such 

25 response may be evaluated so as to i»rovide a meaningful indicia of whether 
the event has occurred. 

rnmTwund . The term "compound" is intended to refer to a 
chemical entity, whether in the soUd, Uquid, or gaseous phase. The term 
should be read to include synthetic compounds, natural products and 

30 macromoiecular entities such as polypeptides, polynucleotides, or Upids, 



•13- 



and also small entities such as neurotransmitters, ligands, hormones or 
elemental compounds. 

Bioactive Compound . The term "bioactive compound" is intended 
to refer to any compound which induces a detectable or measurable 
response in the methods of the invention. 

Promoter , A •promoter" is a DNA sequence located proximal 
to the start of tzansaiption at the 5' end of the transcdbed sequence. The 
promoter may contain multiple regulatory elements which interact in 
modulating transcription of the operably-linked gene. 

Expression , Expression is the process by which the information 
encoded within a gene is revealed. If the gene encodes a protein, 
expression involves transcription of the DNA into mRNA, the processing 
of mRNA Qf necessary) into a mature mRNA product, and translation of 
the mature mRNA into protein. 

A nuddc add molecule, such as a DNA or gene is said to be 
'capable of expressing" a polypq)tide if the DNA contains the coding 
sequences for the polypeptide and expression control sequences which, in 
the appropriate host environment, provide the ability to transcribe, process 
and translate the genetic information contained in the DNA into a protein 
product, and if such expression control sequences arc operably-linked to the 
nucleotide sequence which encodes the polypeptide. 

Cloning vehicle . A "cloning vehicle" is any molecular entity that is 
capable of delivering a nucleic add sequence into a host cell for cloning 
purposes. Examples of doning vehicles indude plasmids or phage 
genomes. A plasmid that can replicate autonomously in the host odl is 
especially desired. Altemativdy, a nuddc add molecule that can insert 
into the host cell's chromosomal DNA is especially useful. 

Cloning vehicles arc often characterized by one or a small number 
of endonuclease recognition sites at which such DNA sequences may be cut 
in a determinable fashion without loss of an essential biological function of 
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the vehicle, and into which DNA may be spliced in order to bring about its 
iq)licaticn and cloning. 

ITie cloning vehicle may further contain a marker suitable for use in 
the identification of ceUs tiansfonncd with the cloning vehicle. Markers, 
for example, are tetracycline resistance or ampicillin resistance. The word 
■vector" is sometimes used for "cloning vehide." 

FT presaon vehicle . An "expresaon vehicle" is a vehide or vector 
similar to a cloning vehicle but is espedally designed to provide sequences 
capable of expressing the cloned gene after transformation into a host. 

In an expression vehide, the gene to be doned is usuaUy operably- 
linked to certain control sequences such as promoter sequences. 
Expression control sequences will vary dq»ending on whether the vector is 
designed to express the operably-linked gene in a prokaryotic or eukaryotic 
host and may additionally contain transcriptional dements such as enhancer 
dements, termination sequences," tissue-specifidty dements, and/or 
translational initiation and termination ates. 

Host . By "host" is meant any organism that is the redpient of a 
cloning or ecpression vehicle. 

a, y<a>1arinn of c-^Tyc Protein Fmm Mammalian Cells and 

pprparation o f Prartions Containinp C2 and C?' C9nip)ex 
pjnHinp Activity 

Although there have been previous reports of purified Myc protein, 
the present inventors found that the Mvc protein prqaiations described 
therein, and the methods used to isolate that protdn, feUed to achieve the 
requisite amount of ^dd needed to sequence characterize Myc action in 
mammalian sources. Tlie inventors have overcome this problem and 
describe, for the first time, a unique and useful method for the isolation of 
highly purified mammalian c-Myc protein .which provides the requisite high 
degree of quantity of mammalian c-Myc protein needed for the 
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characterization of c-Myc directed DNa binding and biological action. The 
inventors have also been able to purify large quantities of Myc from a 
recombinant insect cell system. The purified Myc protein of the invention 
exhibits the only known biochemical activity of c-Myc, an ability to bind 
the sequence CACGTG. As a direct result of the method of the invention 
for the isolation of c-Myc protdn, the inventors were able to identify 
pq)tides that naturally with c-Myc in a heterooligomers, or 

peptides that naturally associate with each other as a result of the action of 
c-Myc, such peptides found to be present in certain colunm 
chromatography fractions of the c-Myc purification scheme. 

Accordingly to the invention, purification of Myc from a 
mammalian source is preferably achieved utilizing a mammalian cell line 
that ovexexpresses either recombinant or non-recombinant c-Myc and is 
performed completely on ice or equivalent temperatures of 0-5^C, using 
reagents and buffers at the same temperature. For example, the 
overexpressing Chinese hamster ovary (CHO) cell line 5 A is useful for 
such purification. In CHO 5 A cells, recombinant mouse c-Myc is under 
the control of a regulatable promoter, and has been integrated and 
amplified in the genome of the parent CHO cell line for maximum stability 
and production. Except where otherwise noted, for the methods and assays 
of the invention, the native or recombinant Myc should include at least the 
two coding exons of Myc. 

After collecting the cells by ccntrifugation using techniques known 
in the art, and prior to lysis of the outer cell membrane, the cells should be 
washed at least once in a low salt neutral buffer such as 0.9% NaCl in 10- 
50 mM phosphate, pH 7.0-7.5 (phosphate buffered saline, PBS) to remove 
remaining growth medium. 

Lysis of the washed cells is also achieved in a low salt, neutral to 
mildly acidic lysis buffer, preferably about pH 6.8, containing at least one 
protease inhibitor, such as aprotinin or phenylmethylsulfonyl fluoride 
(PMSF), preferably containing a combination of such inhibitors. Salts such 
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as potassium On the KQ form) and magnesium On the MgCl2 form) are 
also preferably added. In addition, nonionic detergents such as NP40 (0.5% 
v/v) and Na-deoxycholate (0.1 %) should be added. 

Cell outer membrane lysis should be performed under conditions 
that lyse the host cdl without lysing the nucleus, or induce significant 
leakage ftom the nuclear membrane. Hie cdls may be aUowed to sit for a 
short period of time, for example, 10 minutes, in the detergent-containing 
lysis buffer before mechanical intervention is utilized in the lysis step. 
Mechanical intervention is best performed with a gentie disruption of the 
detergent treated cells, for example, utilizing 40 strokes in a Dounce 
homogenizer witii a type A pestte, or the equivalent of such treatment. 

Nuclei may be collected from the. lysed cdl prqaration using 
techniques known in the art, such as, for example, ceatiifugation at lOOOxg 
for 5 min at 4"C and washed at least once in the same low salt lysis buffer 
used to lyse the outer cell membrane. 

Nudd are then resuspended in the low salt lysis buffer that 
additionally contains suffident DNAse I and incubated for a time sufficient 
to efficadously degrade the DNA in such nucld to a size and viscosity that 
allows subsequent purification of the c-Myc from this preparation as 
described below. 

Following the DNAse I treatment, die sample is diluted with a high 
salt neutral buffer tiiat brings the salt (as NaQ) concentration of the sample 
to at least 2 M. Such high salt buffer preferably additionally also contains 
amounts MgCl2 suffident to maintain the same concentration of this, salt in 
the final diluted prq»aration, and also additional detergent NP40so as to 
retain efficadous levels after sample dilution. 

Ill mammalian host cdls, CrMyc is generally tightly associated with 
the nuclei. Accordingly, it is necessary to solubilize c-Myc in a manner 
tiiat does not destroy its biological activity dr its ability to renature into a 
biologically active form. The residual nuclear material is first removed by 
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ccntrifugation and then the pellet resuspended for solubilization of the c- 
Myc. Solubilization of the c-Myc protein in a manner that destroys this 
association may be achieved with either sodium dodecyl sulfate (SDS) or 
uita at concentrations greater than 4 M. Preferably, SM urea is utilized. 
5 Residual non-lysed nucld may also be solubilized at this time by vigorous 
stirring for about 30 min. The solution is then centrifiiged to pellet any 
remaining insoluble material prior to the subsequent chromatography stq>s, 
for example, at SOOOxg for about 10 min. 

The supernatant fraction recovered from the centrifiigation stq) is 

10 applied to a DEAE Sq)harose CL-6B column equilibrated in the urea- 
containing buffer as described above, and the column thoroughly washed 
with such buffer to remove unbound protdn. A second wash was 
performed with the addition of an intermediate amount of NaCl, 0. IM 
NaCl to the buffer. Finally, Myc protein was eluted by raising the salt 

15 concentration in the buffer to 0.35M. 

All protein duting with the 0.35M salt wash were collected and 
applied to a FPLC Mono-Q colunm. The column was washed and with a 
gradient of 0.10 M NaCl to 0.35 M NaCl, followed by a 2 M NaCl step 
wash. Holding the gradient at intermediate salt concentrations, for example 

20 at about 0.19 M NaCl, until the end tail of the contaminating protein is 
finished eluting will enhance the purity of the subsequently eluted Myc 
protein. 

Myc may be identified in the column duent by any technique that 
specifically recognizes Myc protdn or its activity. For example, a 

25 monoclonal antibody such as IF? may he used in an imihunoassay for the 
presence of Myc protein. Alternatively, DNA binding activity to an 
oligonucleotide containing the sequence 5'-CACGTG-3' may be used to 
monitor the purification. Monoclonal antibody 1F7 is directed against the 
peptide sequence of amino adds 305-317 in murine c-Myc. Other Myc 

30 monoclonal antibodies useful in such assays are commercially available. 
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Pools of fractions from this column contained the C2 and C2' 
binding activities described below, and the presence of peptides capable of 
entering into C2 and C2' hetero-oligomers, and cspedaUy C2 and C2 ' 
heteKH)Iigomers, may be assayed by the ability of such heteKK)ligomers to 
5 bind to the DNA sequences 5'-CACGTG-3' and 5'-CAGCTG-3', 

respectively. Myc purified from the CHO cells appeared as multiple bands 
by immunobloL 

b. Purification of c-Mvc a^ H Its Partners From a 
Raciilovinis Source 

10 Human c-Myc may also been purified using the baculovirus 

overexpression system. For purification, S0 cells that had been infected 
with recombinant baculovirus canying the c-Myc gene, using techniques 
known in the art were harvested just prior to the onset of lysis (-48 hours 
post infection). Solubilization and purification of the recombinant c-Myc 

15 were carried out as with the CHO produced Myc resulting in a yield of 2.5 
mg/8xl0' cdls. Myc purified from these insect cells was apparently 
homogeneous by silver staining, and ran on electrophoresis as a single 
diffuse band of - 60kD. This was in contrast to the multiple bands 
observed with mammalian Myc by imraunoblot (Fig. IB). 

20 c. netecrion of S «iiience Soedfic DNA Binding Activity 

The above prqarations contain two sequence spedfic DNA-binding 
activities that both contain Myc protdn. The first activity contains only 
Myc (i.t., forms the Myc homo-oUgomer) and binds very weakly to 
sequences with the core CACGTG. The binding is assayed by determining 
25 the off rate and by competitor assays, both techniques known in the art. 

The binding of c-Myc homo-oligomers is characterized by an immeasurably 
fast off rate and by the observation that it i> almost impossible to add 
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enough cold competitor sequence to completely compete away this complex 
in electrophoretic mobility shift assays (EMSA). This latter observation 
implies that it may not be possible to raise oligonucleotide concentrations 
above the Kq, thus preventing the detennination of exactly what fraction of 
the final Myc preparations are active for sequence specific binding by the 
Myc homo-oligomers. 

A binding site selection procedure may be used to detexniine the 
optimal binding site for Myc. Sites may be selected from a pool of random 
oligomers, such as 20-mers, in order to decrease bias in detennining an 
optimal binding site. A 12 base consensus sequence of GACCACGTGCTC 
[SEQ ID No. 1] may be used, with the central E box core of CACGTG 
zppezxing to be most conserved. Halazonetis and Kandil (Halazonetis and 
KandU, Proc. Nml Acad. Sci. USA 88:6162-6166 (1991)) assumed that the 
flanking sequences might be symmetric, and rqx>rted an optimal sequence 
of GACCACGTGGTC [SEQ ID No. 2]. This sequence is quite similar to 
the consensus that is preferred here, differing in only the 10th position 
(where predominandy a C was utilized in the invention, although G is 
significantiy rq)resented Fig. 7, Group I). Accordingly to the invention, it 
is possible to select a 12 base consensus sequence from a pool of predicted 
complexity of 4^^ 10^^) thus indicating that Myc has a strong sequence 
preference despite its apparent weak binding affinity. 

The second Myc containing DNA-binding complex provided in the 
prq>arations of the invention also binds to sequences with a core of 
CACCjTG, but binds dgnificandy more tighdy than Myc alone. This 
complex (the C2 complex) requires a 26-29 Id) factor in addition to Myc. 
This additional factor copurified with Myc, presumably because of similar* 
chromatographic properties and not via association with Myc since the 
chromatography performed in 5M urea would denature such association. 
This additional factor resembles Max, a protein whose gene was recently 
isolated from mammalian cells, in that it does not bind efficiently to DNA 
by itself but can heiero-oligomerize with Myc to bind tighdy to the 
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sequence CACGTG. However, that the fector of the invention differs from 
Max in its apparent size (Max is reported to migrate at 21 kD). 
Additionally, the Myc/Max hetero-oligomer appears to migrate at least as 
slowly as a Myc only complex in EMSAs, while the d complex of the 

5 invention migrates more rapidly than Myc alone. 

In addition to the 26-29 KD fector, a second cqpuri^g factor of 
40-50 kD has been identified. Ihe sites selected by complexes containing 
tills fector (herein termed C2' complexes) contained a CAGCTG core (the 
/xE2 sequence motif) as weU as flanking sequences which bear a striking 

10 resemblance to a lecenUyrqwrted binding site for myogeninhomo- 
oligomers (Wright et al., Mol CeU. Biol. JJ:4104-4110 (1991)). 
Myogenin is an HLH containing protein of predicted molecular weight 32.5 
ld> whose optimal binding site is AACAGT/CTGTT [SEQ ID No. 3]. 
None of the sites (0/36) selected by the C2 or C2' complexes of tiie 

15 invention centred a CAGTTG motif wfafle roughly half of tiie myogenin 
sites contained such core sequences. 

d. j Atsav for a rnmpound th at Inhibits Mvc Action 

For tiie ease in describing tiiese assays, CI complex association 
and/or DNA binding, C2 complex association and/or DNA binding, and 
20 C2 ' complex assodation and/or DNA binding arc all referred to as c-Myc 
activity. 

Assays for c-Myc activity may be performed in vitro or in vivo. In 
vim> assays may be performed as described in tiie Examples, for example, 
by evaluating tfie effects tiie deared compound or various amounts of such 
25 compound on tfie results of tiie electrophoretic mobiHty shift assay and site 
selection techniques tiiat will reveal whetiier binding of tiie oUgomer of 
interest to a specific DNA sequence motif has occurred in tiie presence of 
the compound. 
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For the in vivo assay of a compound that inhibits the desired Myc 
activity at least two genetic constructs are utilized. First is required a 
recombinant construct capable of expressing Myc is required; second is 
required a rqwrter gene whose expression is opeiably linked to the Myc 
activity and especially to the binding of the desired oligomer to the specific 
DNA sequence or motif. 

If desired, a recombinant constnia capable of expressing a C2 
complex protein or C2' complex protein may also be used. Alternatively, a 
host may be chosen may be chosen that naturally expresses such protein. 

Recombinant constructs that are capable of expressing Myc protein 
may be constructed utilizing the guidelines as described below or purchased 
commercially. 

The desired DNA binding sequence may be operably linked to any 
gene which confers a selectable marker in the host system. In a preferred 
embodiment, a marker gene which allows phenotypic selection in yeast, 
and especially in Saccharomyces cerevisiae is used. 

Yeast that have been co-transformed with both an expressible myc 
gene and with the desired DNA binding sequence may be used to (1) 
identify the presence or absence of endogenous host proteins that interact 
with Myc in a C2 or C2 'complex (2) classify a protein as a CI complex 
protein or as a C2 ' complex protein; and (3) identify and classify 
compounds as agents which disrupt such Myc activity. C2 complex proteins 
have previously also been termed Myc "partner" proteins. 

All three applications are based on the same principle: in the 
presence of c-Myc biological activity, one of three things will happen: CI 
complexes wiU form; C2 complexes will form; or, C2' complexes will 
form. The protein complexes so formed, and especially the oligomeric 
complexes, will bind to a specific DNA motif, binding to such motif will 
be operably linked to the marker gene, and expression of the marker gene 
will be altered, preferably stimulated, in response to such DNA binding. 
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In the absence of such oligomerization, oligomer-DNA complex formation 
wiU not occur and expression of the marker protein wiU not be altered. 

In the assays of the invention, there may be some level of binding to 
a desired DNA binding sequence even in the absence of c-Myc. However, 

5 when c-Myc is present in the cell, the amount and strength of the specific 
DNA binding is increased. 

Hosts that have been co-transformed with both an expressible c-Myc 
gene and with the desired DNA binding sequence may be used to assay for 
the presence or absence of endogenous host proteins that interact with c- 

10 Myc activity. If such analyses reveal that the host contains c-Myc binding 
proteins, or c-Myc dependent oligomers which, in the presence of c-Myc 
specificaUy bind to a deaied DNA sequence, such c-Myc partner protein or 
d^dent-oUgomer protein may be isolated using techniques known in the 
art such as gd mobility shift analysis, cDNA expression cloning vectors 

15 such as, for example, XgtlO and Xgtll, or other doning systems 

specifically designed for high-effidency cloning and expression of full- 
length cDNA in yeast such as, for example, pGl and pTRP56, aU of which 
are corameicially available (Qontech, Palo Alto, California). 

It is not necessary that the host be completely defident in C2 

20 complex proteins (c-Myc partner proteins) or C2 ' complex proteins to be 
useful in the method of the invention. As described bdow, if c-Myc is 
expressed at levels much greater than those found in the host, rqwrter gene 
transciq>tion from endogenous partner proteins may be negUgible, or of 
such low amount that it does not otherwise alter the utiKty of the methods 

25 of the invention. 

If the c-Myc expression is transcribed with a strong promoter, 
and/or if the c-Myc expression cassette is suppUed on a high copy number 
vector, the levels of c-Myc wiU be high enough to overcome a low level 
background and such c-Myc constnicts may be used to analyze the abiUty 

30 of cloned c-Myc partners to influence c-Myc DNA binding. One of 
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oniinaiy skill in the art can adapt the expression system to the level of 
expression desired using methods known in the art. 

The C2 complex protein (the partner protein), or the C2 ' complex 
protein, if supplied as a recombinant construct to the host cell, should be 
capable of expressing at levels comparable to that of the c-Myc protein. 
C2 complex proteins may be identified by utilizing a phage plaque assay, 
as described in the commonly-owned, copending U.S. patent application 
entitled "Protein Partner Screening Assays and Uses Thereof," Application 
No. 510,254, filed April 19, 1990, and incorporated herein by reference. 
Proteins identified by such screening assay can be subcloned into 
eukaryotic expression vectors known in the art and commercially available 
so as to provide a recombinant source of partner protein gene expression. 

The genetic constructs of the invention may be placed on different 
plasmids, or combined on one plasmid. A construct may also be inserted 
into the genome of a host celL Preferably, the constxua coding for the c- 
Myc protdn and the construct coding for the C2 complex protein or the 
C2 ' complex protein are provided to the host on two different plasmids. 

It is important to establish that the effect of the compound is due to 
an effect on c-Myc activity and not an effect on the activity of tiie reporter 
product per se. Such effect can be established by comparing the results 
found in hosts which lack either tiie c-Myc expression vector or the C2 or 
C2 ' protein expression vector or botii. 

The desired DNA binding motif may be located at any site in the 
transcription cassette of the rqx)rter gene which allows for the transcription 
of that gene to be operably-linked to binding of the desired oligomer. Thus, 
such motif may be located 5' to the transcriptional start site or 3' to the 
transcriptional start site, for example, in an intron, similar to its location 
relative to the promoter region in the immunoglobulin genes. 

The reporter gene whose expression is operably linked to c-Myc 
activity and especially to oligomer DNA binding may be any gene whose 
expression can be monitored. Any detectable phenotype change may serve 
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as the basis for the methods of the invention. In a preferred embodiment, 
the rqwrter gene is a gene not normally expressed by the host, or a gene 
that rq)laces the host's endogenous gene. Any rqwrter gene which is 
capable of being operably-linfced to a promoter capable of responding to the 
5 binding of the oligomer of interest to the specific target DNA sequence 
may be used. 

Especially, for example, genes that endow the host with an ability to 
grow on a selective medium are usefiiL For example, in yeast, use of the 
yeast LEU2 gene as a rqwrter gene in strains that normally lack LEU2 

10 allows such yeast to grow on leucine as a sole carbon source. Expression 
the rqwrter gene is monitored by merely observing whether the host 
possesses the ability to grow on leucine. In a similar manner, use of the 
sudl gene as a rqwrter gene would allow growth of the a suc2' yeast host 
on sucrose to be used as the detection method. In both examples, growth 

15 on the indicated substrate could be used to indicate spediic DNA binding 
of the oligomer of interest and lack of such growth could be used to 
indicate lack of binding or lack of oligomer formation. 

In another example, a construct (and host) which is goi\'^gol\^' 
would respond to galactose in the medium; a construct (and host) which is 

20 lacl'^gall'^ would be lactose sensitive. Other rqwiter genes include Aii3, 
ura3andflp5. One of ordinary skill in the art can imagine many other 
appropriate r^rter systems which would reveal the presence or inhibition 
of DNa binding or biological activity of the oligomer of interest. 

Rq)orter constructs in which the deared DNA sequence motif and 

25 the tocZ rqwrter gene are operably linked' will express P-galactosidase in 
response to binding of a c-Myc activity induced oligomer binding to such 
DNA sequence. Such expression can be eaaly scored by monitoring the 
ability of tiie host to produce /S-galactosidase (Maniatis, T.etal., 
Molecular Cloning (A Laboratory Manual), 2nd edition, Cold Spring 

30 Harbor Laboratory, 1989). The production of /S-galactosidase may be 

visually monitored by detecting its activity to reduce the chromophoric dye. 
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X-gal (commercially available from International Biotechnologies, Inc., 
New Haven, CI). iS-galactosidase reduces X-gal to a fonn which 
possesses a blue color. In another embodiment, the coding sequence of 
chloramphenicol acetyltransferase (CAT) is used as the rqx)rter gene. 

Any detection method that can identify expression of the reporter 
gene may be used. For example, levels of the product of the zepoTtcr gene 
may be directly assayed with an immunoassay. Such immunoassays 
include those wherein the antibody is in a liquid phase or bound to a solid 
phase carrier. In addition, the reporter gene can be detectably labeled in 
various ways for use in immunoassays. The preferred immunoassays for 
detecting a rqx)rter protein using the include radioimmunoassays, enzyme- 
linked immunosorbent assays (ELISA), or other assays known in the art, 
such as immunofluorescent assays, chemiluminescent assays, or 
bioluminescent assays. 

In an assay to screen for the ability of a compound to alter binding 
of the oligomer of interest, yeast strains that express such the desired 
peptide or peptides and which contain the related DNA binding sequence 
motif, may be plated and grown as lawns and the compound to be tested 
may be applied to the plates on a filter paper disk that is impregnated with 
such compound. Alternatively, the compound may be incorporated into the 
media within which the host cells are growing. 

One nuy be able to detect the ability of a compound to alter c*Myc 
activity by the appearance of a zone, which oftsn resembles a halo, around 
the compound-impregnated disk. If for example, the compound is toxic to 
the host's survival per se, the host will not grow in the zone containing the 
compound. 

The methods of the invention can be used to screen compounds in 
their pure form, at a variety of concentrations, and also in their impure 
form. The methods of the invention can also be used to identify the 
presence of such inhibitors in crude extracts, and to follow the purification 
of ihe inhibiiois iherefrom. The methods of the iaveation are also useful in 
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the evaluation of the stability of the inhibitors identified as above, to 
evaluate the efficacy of various prqjarations. 

The permeability of cells to various compounds can be enhanced, if 
necessary, by use of a mutant cell strain which possess an enhanced 
permeability or by using compounds which are known to increase 
permeability. For example, in yeast compounds such as polymyxin B 
non^)q>tide may be used to increase the yeast's permeability to small 
organic compounds. In cells ftom the higher eukaryotes, dimethyl sulfoxide 
Q3MS0) may be used to increase permeability. Analogs of such 
compounds which are more permeable across yeast membranes may also be 
used. For example, dibutyiyl derivatives often display an enhanced 
permeability. 

in a pr^erred onbodiment, the genetic constructs and the methods 
for using them are utilized in eukaryotic hosts, and especially in yeast, 
insect and mammalian cells. The introduced sequence is incorporated into 
a plasmid or vector capable of either autonomous replication or integrative 
activity. 

The sequence of c-Myc is known (Battey, l.etal.. Cell 54:779-787 
(1983)) and probes which are c^ble of identifying a c-Myc clone are 
commercially available (New England Nuclear/DuPont Biotechnology 
Boston, MA). 

The DNA sequence of the desired gene may be chemically 
constructed if it is not desired to utilize a done of the genome or mRNA as 
the source of the genetic information. Methods of chemically synthesizing 
DNA are well known in the ait (Oligomicleotide Synthesis, A Praaical 
Approach, MJ. Gail, cd., IRL Press, Washington, D.C., 1094; Synthesis 
and Applications of DNA andRNA, S.A. Narang, ed., Academic Press, 
San Diego, CA, 1987). Because the genetic code is degenerate, more than 
one codon may be used to construct the DNA sequence encoding a 
particular amino acid (Watson, J.D., In: Molectdar Biology of the Gene, 
3rd edition, W.A. Benjamin, Inc., Menio Park, CA, 1977, pp. 356-357). 
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To express the recombinant constructs of the invention, 
transcriptional and translational signals recognizable by the host are 
necessary. A cloned protein encoding DNA sequence, obtained through the 
methods described above, (prtfe^ly in a double-stranded form), may be 
operably*linked to sequences controlling transcriptional expression in an 
expression vector, and introduced, for exan^le by transformation, into a 
host cell to produce recombinant proteins useful in the methods of the 
invention, or functional derivatives thereof. Such techniques are well 
known in the art {Recombinam DNA Methodology, Wu, R. et al., eds., 
Academic Press, (1989); Maniatis, T.aal., Molecular Cloning (A 
Laboraiory Manual), second edition, Cold Spring Harbor Laboratory, 
1989). 

Transcriptional initiation regulatory signals can be selected which 
allow for repression or activation of the expression of the c-Myc construct 
or construct of the recombinant d complex peptide (or the C2 ' pq)tide), 
or both, so that expression of such constructs can be modulated, if desired. 
Of interest are regulatory signals which arc temperature-sensitive so that by 
varying the temperature, expression can be repressed or initiated, or are 
subject to chemical regulation, for example, by a metaboUte, salt, or 
substrate added to the growth medium. 

Where the native expression control sequences signals do not 
function satisfactorily in the host cell, then sequences functional in the host 
cell may be substituted. 

Expression of the constructs of the invention in different hosts may 
result in different post-translational modifications which may alter the 
properties of the proteins expressed by these constructs. It is necessa r y to 
express the proteins in a host wherein the ability of the protein to retain its 
biological function is not hindered. Expression of proteins in yeast hosts is 
preferably achieved using yeast regulatory signals. The vectors of the 
invention may contain operably-linked regulatory elements such as 
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upstream activator sequences in yeast, or DNA elements which confer 
spedes, tissue or ceU-type specific expression on an operably-Iinked gene. 

In general, expression vectors containing transcriptional regulatory 
sequences, such as promoter sequences and transaction tenninaiion 

5 sequences, arc used in connection with a host. These sequences fedUtate 
the efficient transcrqition of the gene ftagment opoably-linkcd to them. 
In addition, cxpresaon vectors also QrjncaUy contain discrete DNA 
elements such as, for example, (a) an origin of xqilicalion which aUows for 
autonomous rq)Ucation of the vector, or, elements which promote insertion 

10 of the vector into the host's chromosome in a stable manner, and (b) 
^edfic genes which arc capable of providing phenotyiric selection in 
transformed cells. Eukaryotic expression vectors may also contain elements 
which allow it to be maintained in prokaryotic hosts; such vector arc 
known as shuttle vectors. 

15 The precise naturc of the regulatory regions needed for gene 

expression will vary between species or ceU types and there are many 
q)pnjpriate expression vector systems that arc commercially available. 

In a highly prcfened embodiment, yeast are used as the host cells. 
The elements necessary for transcriptional expression of a gene in yeast 

20 have been recently reviewed (Struhl, K. Ann. Rev. Biochan. 55:1051-1077 
(1989)). In yeast, most promoters contain three basic DNA elements: (1) 
an upstream activator sequence (UAS); (2) a TATA dement; and, (3) an 
imtiation (I) dement Some promoters also contain operator dements. 
Methods in yeast genetics are wdl known (Struhl, K. Namre 505:391-397 

25 (1983); Sherman, et al. Methods in Yeast Genetics, Cold Spring Harbor 

Laboratory (1983)). 

In another embodiment, mammalian cdls are used as the host cdls. 
A wide variety of transcriptional and translational regulatory signals can be 
derived for expression of proteins in mammalian cdls and especially from 
30 the genomic sequences of viruses which infect eukaryotic cells. 
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Once the vector or DNA sequence containing the construct(s) is 
prepared for expression, the DNA construct(s) is introduced into an 
appropriate host cell by any of a variety of suitable means, for example by 
transformation. After the introduction of the vector, recipient cells are 

5 grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene sequcnce(s) results in the 
production of the protein. Tliis expression can take place in a continuous 
manner in the transformed cells, or in a controlled maimer. 

Genetically stable transformants may be constructed with episomal 

10 vector systems, or with integrated vector systems whenAy the fusion 

protdn DNA is integrated into the host chromosome. Such integration may 
occur de novo within the cdl or be assisted by transformation with a vector 
which functionally inserts itself into the host chromosome, for example, 
with retroviral vectors, transposons or odier DNA elements which promote 

IS integration of DNA sequences in chromosomes. 

Cells which have been transformed with the DNA vectors of the 
invention are selected by also introducing one or more markers which allow 
for selection of host cells which contain the vector, for example, the 
marker may provide biocide resistance, e.g., resistance to antibiotics, or 

20 heavy metals, such as copper, or the like. 

The transformed host cell can be fermented or cultured according to 
means known in the art to achieve optimal cell growth, and also to achieve 
optimal expresaon of the cloned protdn sequence fragments. As described 
hereinbdow, a high level of recombinant protdn expression for the cloned 

25 sequences coding for .the proteins can be achieved according to a preferred 
procedure of this invention. 

The methods of the invention are not intended to be limited to c- 
Myc and possess utility for the characterization of inhibitors against any 
Myc protein, such as, for example, N-Myc and L-Myc. The C2 complex 

30 peptides of the invention may interact with more than one Myc protein and 
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the C2' complex peptides of the inventions may form as the result of the 
activity of more than one Myc protein. 

The foUowing examples forther describe the materials and methods 
used in carrying out the invention. Hie examples are not intended to limit 
5 the invention in any manner. 

TRAMPLES 

Example 1 
^f^tfT^U and Methods 

i-f ]] nmwth and M Y^ nv^T«mT«sion: Hie 5A ceU line was maintained in 

10 spinner culture tmder selection with 80 iM mediotrexate. Protein 

purification started wifli roughly 6 Uters of cdls at SxlO^/ml grown up 
witiiout sdection. Heat shock promoter induction was achieved by 
resuspension in preheated fresh media (43''C) at 1/3 the original volume. 
Cells were incubated with stilling at 43-C for Ih. To aUow translation of 

15 tiie accumulated mRNA, ceUs were transferred to 3TC culttirc conditions 
for3h. Cells were then subjected to the purification described below. 

The baculovirus overexpression vector was constructed by insertion 
of tiie BamHl/BcU fragment of pGEMMycB [Halazonefis and KandU, 
Proc Natl Acad. Sci. USA 88:6162^166 (1991)] into the BamHl site of a 

20 baculovirus expression vector, pVL941, obtained from the laboratory of 
Max Summers (IcxasA&M University, College Station, Texas). The 
resulting plasmid contained the entire coding sequence of the human Myc 
gene including 6 nucleotides 5' of tiie initiation codon and 3' untranslated 
sequence extending to tiie genomic Rsal site, Sf9 cdls were grown and 

25 infected with recombinant baculovirus according to ttie metiiods of 

Summers [Summers and Smitii. A Manual of Methods for Baculovirus 
VeaoTS and Insect Cell Culture Procedures, Texas Agricultural Experiment 
.Station bulletin No. 1555J witii minor changes. Cells were passaEed in 
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spinner culture and plated on 150 mm diameter tissue culture plates for 
protein production. Cells were infected and harvested approximately 48 h 
post infection by sending. Cells were then washed in PBS and subjected 
to the purification described below. 

The Protein A<-Myc fusion protein was expressed in the E. coli 
AR68 strain fiom a previously published pRITZT vector [Dang, C.V., 
AnaL Biochem. 77^:313-317 (1988)] which fused the Ig binding portion of 
protein A to either amino acids 353-439 or amino adds 372-439 of c-Myc. 
Growth and induction of the cells was as per Dang et ed, [Anal. Biochem. 
174:313-317 (1988)). 

Protein Purification : All purification steps were carried out on ice or with 
ice cold buffers unless otherwise stated. Cells may be used firesh or stored 
quick frozen in liquid nitrogen for larger batch prqnrations. 5A or Sf9 
cells were washed in phosphate-buffered saline (PBS) and resuspended at 
2.1x10' ceUs/ml in Low Salt Lysis Buffer (20 mM HEPES pH 6.8, 5 mM 
KCl, 5 mM MgClj, 0.5% NP40, 0.1 % Na-deoxychoUte, 1 fig/ml 
aprotinin, and 0.1 mM PMSF) (Evan and Hancock, Cell 45:253-261 
(1985)], After 10 min cells were subjected to 40 strokes in a Dounce 
homogenizer with a type A pestle. Nuclei were pelleted at lOOOxg, 5 min, 
4'C, washed once in 50 ml Low Salt Lysis Buffer, resuspended at 2.5x10^ 
nuclei/ml in Low Salt Lysis Buffer containing 50 iig/wl DNAse I and 
incubated at 4°C for 1 h. An equal volume of ice cold 2X High Salt 
Buffer (2x concentrations: 20 mM Tris, pH 7.4, 4 M NaQ, 1 mM MgCl2, 
and 0.1% NP40) [Evan and Hancock, Cell 43:253-261 (1985)] was then 
added, mixed gently, and incubated for 10 min. Hie residual nuclear 
material Cmcluding the c-Myc protein) was pelleted (2000xg, 10 min, 4*C) 
and resuspended for solubilization at 5.5x10^ nucleus equivalents/ml in 
Buffer A (50 mM Tris, pH 8.0, 2 mM EDTA, 5 % glycerol, .1 mM DTT, 
and .1 mM PMSF) [Watt et al., Mol. Cell. Biol. 5:448-456 (1985)] 
containing 5 M urea (referred to as 5 M urea Buffer A) achieved by 
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dUution of a freshly deionized stock of 6 M urea. This and all buffers used 
on columns were passed through 0.2 pore fim filter units. Residual nuclei 
were solubilized by vigorous stirring on ice for 30 min. Hiis protein 
solution was centiifuged (10 min, SOOOxg, 4.Q to pellet any insoluble 

5 material prior to chromatography. The supanatant was loaded on a 10 ml 
DEAE Sephaiose CLr€B (Phaimada) column equilibrated with 5 column 
volumes of 5 M uiea Buffer A. Sample loading was at 0.1 ml/min and 
column washing and dution were at 0.4 ml/min. After loading, the 
column was washed with 3 volumes 5 M urea Buffer A containing no 

10 additional salt foUowed by 4 volumes of the same buffer containing 0. 1 M 
NaQ. Myc protein was duted in the following dution st^ at 0.35 M 
NaCL Hie protein containing fractions of this 0.35 M NaQ sxep were 
pooled and dfluted with ftcsh 5 M urea Buffer A to 0.1 M NaQ and loaded 
onto a 1 ml FPLC Mono-Q column (Phannada) run at 0.5 ml/min. The 

15 Mono-Q column was duted with a programmed gradient of 5 ml spanning 
0.10 M NaCl to 0.35 M NaQ foUowed by a 2 M NaQ stq>. For 
enhanced purity the gradient was hdd manuaUy at proximately 0.19 M 
until the major contaminating protein finished eluting as determined by an 
in line UV monitor. In the initial development of the purification protocol 

20 fractions from the columns were assayed for Myc by slot blotting foUowed 
by visualization using the 1F7 monoclonal antibody and ^I-labded 
secondary antibody. For later preparations sUver staining of SDS-PAGE 
aUowed suffidently unambiguous identification of the Myc proteins and 
provided an assessment of the purity of given fractions. The Myc 

25 containing fractions were pooled based on purity and dialyzedaganst 

buffer containing 20 mM Ttis, pH 7.8, 50 mM KCl, 10 % glycerol, 0.1 
mM DTT, and 0.1 mM PMSF (referred to as Dialyas Buffer) in bags of 
SpectroPor 2 membrane for 3 changes, 2 Uters each, for a minimum of 3 h 
each. Pools of fractions prepared this way contained CI and C2 (and C2') 

30 binding activities. To obtain pure CI binding activity the Myc-containing 
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Mono Q fractions were assayed by EMSA and those free of C2 binding 
activity were pooled and dialyzed squrately. 

The bacterially produced Protein A-c-Myc fusion protein was 
partially purified by differential centrifiigation and solubilized in 5 M urea 
according to Watt a al, [Bagchi et al., Mol. Cell. Biol. 7:4151-4158 
(1987)1 with the following minor modifications: Protease inhibitors were 
present in the initial lysis buffer (10 /ig/ml pepstatin, 1 mM PMSF, 50 
/ig/ml aprotinin, 2 ^g/ml leupq)tin, 10 mM Na-metabisulfite, and 1 mM 
benzamidine) and cells were sheared by 6 bursts of 15 s each in a Cuisinart 
MiniMate on ice. The urea solubilized material was cleared of insoluble 
material by centrifiigation (10,000xg, 10 min, 4*C) and dialyzed into 
Dialysis Buffer containing 0.5 mM DTT. Precipitated material was 
removed by centrifiigation (15,000xg, 20 min, 4«Q. Protein A-^Myc 
fiision protein was purified from the supernatant by IgG affinity essentially 
according to Nilsson et d. [EMBO J. 4:1075-1080 (1985)]. A 1 ml aUquot 
of supernatant was incubated with 0.1 ml of a 50% slurry of IgG Sepharose 
6 fast flow (Pharmacia) rocking for 1 h at 4»C. Tlie pellet was washed 
twice with Buffer A and the fusion protein eluted with 0.3 M lithium 
diiodosalicylate (LIS). TTie eluate was then dialyzed extensively to remove 
the US (initially against Buffer A at room temperature to avoid LIS 
precipitation, then against Dialysis Buffer 4'C). The two bacteriaUy 
etpressed Myc prqaradons were compared by Coomassie staining of SDS- 
PAGE to ensure that equal aniounts of the fusion proteins were used for 
experiments. 

fj.Tenninal Sequencing : The.3 bands of purified Myc from 5A cells were 
individually isolated by electioelution according to Hunkapiller a td. 
[Meth. Enz. PJ:227-236 (1983)]. Preparative SDS-PAGE was carried out 
and protein bands excised after visualization with Coomassie Brilliant Blue 
R-250. Alter electroelution the material was precipitated 2 times with 
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methanol/acetone and submitted for N-tenninal sequencing by Edman 
degradation. 

Antibodies : The monoclonal antibody, 1F7 (a generous gift of R. 
Caiizzonite, Hoffman LaRoche), is directed against tlie peptide sequence 
5 comprising amino adds 305-317 in murine c-Myc ^yamoto ex aL, Proc, 
Nad. Acad, Sd. USA 52:7232-7236 (1985)]. Tht antibody directed against 
cl was monoclonal 51F (Breyer and Sauer, J. Biol. Oiem. 2«: 13348- 
13354 (1989)] which had been purified by ammonium sulfate precipitation 
and chromatogra|)hy on QAE Sq)hadex. 

10 fi^i ^hnTerie M^ HjitY ^^^^ ^''"^ ^^A^' Radiolabeled probes were 
produced via a Klenow fill in of annealed oligonucleotides containing 4 
base 5' overhangs at each end (see table bdow fat sequences). Binding 
reactions took place in a final volume of 20 containing 2 ng of labeled 
probe, 125 ng poly d(IC), an indicated amount of proton, and tiie 

15 foUowing final buffer conditions: 10 mM Tris. pH 7.5, 50 mM KCl, 0. 1 
mMEDTA. ImMDTT, ImMMgQz and 5% glycerol. Bmding 
reactions were aUowed to proceed for 20 min at room temperature and 
were then loaded immediately on a 4% polyacrylamide gd which had been 
prerun at least 1 h at lOV/cm. Hectrophoresis was for 1.5 h at lOV/cm in 

20 0.5xTBE. 

riitandRenature: The method of Bagchi et d. (Bagchi et al, Mol CeU. 
Biol. 7:4151-4158 (1987)1 was followed excq»t for tiie final dialysis st^. 
Precq)itated protein samples containing BSA as carrier protein were 
solubilized in 6 M guanidine-hydrochloride (200 /il unless otherwise 
25 indicated) according to Bagchi ex al. [MoL Cell. Biol. 7:4151-4158 (1987)]. 
Directiy prior to analysis by EMSA tht samples were subjected to dialysis 
alone or in combination with another sample in a total volume of 15 id. 
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Equal volumes of each sample were used in a given experiment and the 
volume was brought to 15 ftl using 6 M GuHCl containing 0.1 mg/ml 
BSA. Dialysis was against 40 ml of Dialysis Buffw carried out for 1 h at 
4*C on floating 13 mm m«nbraije discs (Millipoie #VSWP-013, pore size 
5 0.025 fim). 

Site Selection f mm Random Seouences: The following procedure was 
devised based on the method of Pollock and Treisman [NucL Adds Res. 
15:6197-6204 (1990)]. A 52 base oligonucleotide "randomer" (see table 
below) was annealed to the following 16 base primer: Xho I primer 5' 

10 CCGATATCTCGAGACGG 3', [SEQ ID No. 4]. The annealed primer 
was extended using Klenow and nucleotides (0.2 mM cold dNTPs and 0.4 
piM o^^P-dCTP 800Ci/mmol) to create a pool of double stranded probes 
iq)resenting q)proxiniately 4^ sequences. TTie initial round of binding ate 
selection by EMSA utilized 200 ng of this pool and either 0.37 /eg of 

15 baculovirus produced c-Myc or 0.5 /xg of CHO produced c-Myc. Other 
parameters were as previously described for EMSA. Lanes containing 
randomer probes were alternated with reference lanes containing 2 ng 
(USE)3 probe and 0.37 ng of baculovirus c-Myc. The completed EMSA 
gel was dectroblotted onto NA45 membrane (200 mA, 2.5 hrs) and the wet 

20 membrane was wrapped in plastic vnap and exposed for at least 1.5 hrs. 

The regions of the randomer lanes corresponding to the visible CI and C2 
complexes of the reference lanes were exdsed and duted with 100 of 
dution solution (10 mM Tris, pH 8.0, 1 mM EDTA, 1 M NaQ) 30 min at 
68°C. The liqmd was transferred to a ftcsn tube and the membrane was 

25 rinsed with 100 /tl TE which was added to this duate. After pelleting the 
particulate debris, the DNA was prec^tated with the addition of 10 /xg 
glycogen, 2 /il 1 M MgCl2 and 2.5 volumes of ethanol. The pdlet was 
rinsed with 70% ethanol, dried, and the recovery assessed by scintillation 
counter. The entire pellet of each sample (-29-57 pg) was resuspended in 

30 10 /xl lOx PGR buffer (500 mM KCl, 100 mM Tris, pH 8.4, 1 mg/ml 
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gelatin, 15 mM MgQz) and 32 fi\ water. After addition of 1 /zl each of 
100 fiM Xho I primer and Xba I primer (5' GGACGATCTAGATTCG 3', 
[SEQ ID No. 5]), 5 fil of nucleotide mix (2 mM dNTPs and 4 fiM c?^?- 
dCIP SOOO/mmol), and 1 U Tajq polymerase the reactions were overlaid 
wifli paraffin oU and subjected to 20 cycles of PGR in an Ericomp 
machine: 2 min 94«C, 20x (15 sec 15 sec 55^Q, 10 min 72'C. 
The products were gd purified on 1096 acrylamide and precipitated using 
10 lie glycogen as canier. Recovery was measured by sdntillafion counter 
and after resuspension in the EMSA reaction buffer (10 mM Tris, pH 7.5, 
50 mM KQ, 1 mM EDTA, 1 mM MgQz, and 5% glycerol) tfiis probe 
was used for the next round of EMSA selection. Subsequent cydes were 
primarily as above, however, 50 ng of probe was used. Eight rounds of 
sdection and amplification were completed for tfie baculovirus c-Myc and 
seven rounds for tiie CHO c-Myc. After the final PGR reaction the 
products were extracted twice with phenol, twice with ether, and 
predpitated prior to digestion with Xho I and Xba I. After gd isolation the 
^jpropriate ftagment was subdoned into the Bluescript SK vector 
(Stratagene) and sequenced by standard procedures. 

ni fgnni t ^lfinrides Used : OUgonucleotide sequences that were used are 
shown bdow, witii the E-Box core sequences underiined: 

SEQ ID NO. 6: 

(^iE2)3 5' GATCTCTGCAGCAGSGGCAGC^SSgGCAGCAgSIgGCG 
SEQ ID NO: 7: 

(ME3)3 5' GATCTGCAGTC&JGTGGCGTCAIGIGGCGTCATGTGGCAG 



SEQ ID NO: 8: 
(USE) 3 5 



GATCTGCAGTCAOGTGGCGTCACGISGCGTS^CGTGGCAG 3 



SEQ ID NO. 9: 



ft IBCI III ITC CUCCT 
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MLC-A 5' TCGACGTCGCAGCAGGTGCAG 3'; 
SEQ 10 NO. 10: 

MLC-B 5' TCGACCCCA CCAGCTGG CGAG 3'; 

* 

SEQ ID NO. 11: 

5 ERPl/2 5' AGCm^CGA ACACCTG CA GCAGCTGG CAGGAAGCAGGCCTA 3' 

SEQ ID NO. 12: 

ERP3/4 5' AGCTTTAAAATCCCCAC CAGCTG GCGAAGCA ACAGGTG CA 3' 
SEQ ID NO. 13: 

HSE 5 ' AATTGCGAAACCCCTGGAATATTCCGACCTGGCAGCCTC 3 ' ; 

10 SEQ ID NO. 14: 

SMS 5' TCGACTTTAGACCACGTGGTCCCCTCGA 3'; 

SEQ ID NO. 15: 

Randomer 5' GGACGATCTAGATTCG(N)2oCCGTCTCGAGTATCGG 3'- 



15 Purification of c*Mvc Protein 

A primary goal of this work was to purify and characterize Myc 
from a mammalian source. An inducible mammalian overexpression 
system that has been described previously was utilized (Wurm a al.^ Proc. 
Noil Acad. ScL USA 35:5414-5418 (1986)). Briefly, the two coding exons 

20 of the mouse c-Myc gene imder the control of a Drosophila heat shock 
promoter had been integrated and amplified in the genome of a Chinese 
hamster ovary (CHO) cell line. This overexpressing cell line, 5A, was 
adapted to spinner culture. Heat shock (43 ""Q induces transcription of the 
amplified myc genes while a subsequent 2 hour recovery period at normal 

25 ^grewth tefnperature<37X)permics<^^ 
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phosphoproteins of 60, 62, and 72kD which were immunoprecipitable with 
Myc-spedfic monoclonal antibodies (Wum et al., Proc. Natl. Acad. ScL 
USA. 53:5414-5418 (1986)). The c-Myc produced was tighdy associated 
with the nuclei and attempts to solubilize it using a number of detergents, 
5 salts, and reducing agents woe unsuccessful (data not shown). Significant 
solubilization was achieved however with either SDS or with una at 
concentrations greater than 4 M. For purification, the Myc was solubilized 
with 5 M urea and chromatogtaphed on DEAE icsin and FPLC Mono-Q as 
described in materials and methods. The presence of Myc in the column 
10 ftactions was assayed by immunoblot using an antipqitide monoclonal 

antibody, 1F7 (Miyamoto et aL, Proc Natl. Acad. Sd. USA 52:7232-7236 
(1985)). Hiis purification procedure yielded 150/tg of c-Myc per Uter of 
spinner cells (8x10^ cells). The Myc appeared to be 95% homogeneous as 

judged by silver staining ^g. lA). 
15 An alternative translation start ate for c-Myc accounts for some of 

the molecular weight heterogeneity of c-Myc translated in vUro and 
expressed in several cell lines (Hann et al.. Cell 52:185-195 (1988)). THis 
alternate site is upstream from the canonical start site, however, and is not 
present in our overexpressor gene. N-terminal sequence analysis of each of 
20 the three prominent Myc bands described above revealed, as expected, the 
sequence predicted by the canonical start site (data not shown), although 
the N terminal methionine was not present, presumably because of N 
terminal processing. Therefore the potentiaUy important differences in 
apparent molecular wdght that are observed might be attributed to post- 
25 translational modifications and not N-terminal heterogeneity. 

Human c-Myc has also been purified uang the baculovirus 
overexpression system. For purification, S© cells that had been infected 
with recombinant virus were harvested just prior to the onset of lysis (-48 
hours post infection). Myc produced using the baculovirus system has been 
30 previously reported to be both phosphoiylated and tightiy associated with 
the nucleus (Miyamoto et al., Mol. Cell. Biol. 5:2860-2865 (1985)). 
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Solubilization and purification were carried out as with the CHO produced 
Myc resulting in a yield of 2.5 mg/8xlO^ cells, Myc purified from these 
insect cells was apparently homogeneous by silver staining, and ran as a 
single diffuse band of -60kD (Fig. IB). This was in contrast to the 
5 multiple bands observed with mammalian Myc by immunoblot (Fig, IB). 

Discussion 

Myc was purified to near homogeneity from overexpressing 
mammalian cells and baculovirus infected cells. The mammalian derived 
protein appears to be highly modified in contrast to Myc expressed in and 

10 purified from insect cells. Up to 19 distinct spedcs of c-Myc can be 

identified by two dimensional gel electrophoresis (Fig. 1). These species 
differ both in size (approximate MRs of 60,000, 62,000 and 72,000, 
although this estimate of dze can vary with different gel conditions) and in 
pi. These differences in pi might in part be attributed to differences in 

15 phosphorylation, as c-Myc is known to be phosphorylated and the change 
in pi of the species is consistent with incremental additions of phosphate. 
Although the Myc produced by the baculovirus overexpression system does 
, not demonstrate the same molecular weight heterogeneity as the mammalian 
protein, it too is phosphorylated (Miyamoto er a/., MoL Cell Biol 5:2860- 

20 2865 (1985)). The specific sites of phosphorylation have not been 
determined for dther Myc preparation and other as yet unidentified 
modifications may distinguish these two Myc prqarations. 

Example 3 

f fpedfic DNA Binding Activitv Present in Puri fied c-Mvc 

25 The presence of a B-HLH domain in c-Myc suggested that it would 

bind to an E-Box-like sequence of the general pattern CANNTG. These 
sites were first identified in immunoglobulin enhancers but have since been 
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found in many other tissue specific enhancers. It was first determined if 
any of these would be bound by the purified c-Myc protons described in 
Example 2. A large number of E box related sequences were screened by 
dectrophoretic mobiUty shift assays (EMSA). TTJOse shown in Fig. 2 

5 include synthetic oligonucleotides containing trimers of either the ;tE2 

(CAGCTC) or ftE3 (CATGTG) sites of the immunoglobulin enhancer and a 
trimer of the Adenovirus msgor late promoter upstream dement (USE) 
(CACGTG). Two sites from the myosin light chain (MLC) enhancer are 
also shown: the A site (CAGGTG) which resembles the kE2 

10 immunoglobulin enhancer site, and the B site (CAGCTG) which has the 
same core sequence as the /xE2 site. Hie heat shock dement (HSE) served 
as a control since its sequence does not resemble an E-Box core. 

TTiree specific binding activities were detected in this assay forming 
complexes referred to as CI (USE spedfic), C2 (USE spedfic). and C2' 

15 0xE2 spedfic). As demonstrated bdow, despite the comigration of C2 and 
C2', these iqwesent separate complexes based on observed differences in 
protein composition as wdl as binding specifidty. Hie data presented 
argue that the CI complexes are formed by homo-oligomers of Myc whUe 
formation of the C2 and C2' complexes each require an additional protein.- 

20 The slowly migrating complex (CI) formed most readUy on the USE (Fig. 
2, lanes 5, 11, and 12), less wdl on the similar mE3 site (Rg. 2, lanes 2 
and 8), and not at all on the other B-Box and non-E-Box sites tested. CHO 
and baculovirus Myc pr^arations were similar with regard to the CI 
complex, however they differed with regard to the fester migrating 

25 complexes, in the mammalian Myc assays the C2' complex formed on the 
;iE2 site of the immunoglobulin enhancer and the is /lEZ-like sequence of 
•the MLC-B site (Fig. 2, lanes 1 and 4). Baculovirus Myc contained no 
binding activity with this specifidty (Fig. 2. lanes 7 and 10). In contrast, 
formation of the C2 complex was detected using dther Myc preparation. 

30 The C2 complex formed most readUy on the USE site (Fig. 2, lanes 5, 11, 
and 12) and less weU on the similar itE3 sequence (Fig. 2, lanes 2 and 8). 
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Very little if any binding was detected on the kE2-like sequences (MLC-A 
Fig. 2, lanes 3 and 9, and fiES, data not shown). No specific binding was 
found on non-E-Box sequences such as the HSE (Fig.2, lane 6 and 13). 
Competition experiments were performed on the three binding 

5 activities CI, C2, and C2 ' to further characterize their specificity (data not 
shown). In experiments using /iE2, /xE3, USE, /iE5, or HSE sequences as 
competitors, competition of the C2' complex formed on the ftE2 probes 
was most «aly achieved with the ftE2 oligos while the C2 complexes were 
preferentially competed by the USE sequence. The CI complex was also 

10 competed most effidendy by the USE sequence. A detailed analysis of the 
binding specificities of these complexes is presented bdow. 

Example 4 

Pmteins Resoonrihle for For mation of CI. CI. and C2' Complexes 

One scenario suggested by the differences in binding is that Myc 
15 might not be the only protein involved in formation of the three complexes. 
To distinguish the role of c-Myc from other copurifying proteins in the 
formation of the observed complexes cut and renamre experiments were 
performed as follows. Prqarative amounts of Myc were sqxaiated by 
SDS-PAGE. Proteins were dectroeluted ftom various molecular weight 
20 slices, predpitated, solubilized in guanidine-hydrochloride and dialyzed to 
renature for analyas by EMSA. The CI complex binding activity may be 
icnatuied from the Myc containing slices of either baculovirxis or 
mammalian prqarations (Fig- 3) while no other slices from the entire gel 
contained CI actiivity (data not shown). These data argue that Myc alone is 
25 the proton responsible for tfie CI complex, and that full length Myc 

protein as acpressed in eukaryotic cells can bind specifically to sites with 
the core sequence CACGTG. 

Analysis of the proteins responsible for formation of the C2 and 
C2' complexes was achieved with additional cut and renature experiments 
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perfomed as described above. EMSA using the USE probe revealed no 
single sUce from CHO or baculoviius prqaiations which contained 
detectable C2 binding activity (data not shown). However, this activity 
was recovered by reaaturing proteins from a 26-29 kD slice together with 
proteins in the 60-70 kD Myc containing sUce (Fig. 4, lanes 1-8). Hie 26- 
29 kD component was present in gels loaded with either CHO or 
baculovirus produced c-Myc, and, when leaatured with Myc, demonstrated 
the same specificity as the C2 complex in the loaded materiaL Rcoaturation 
of the 26-29 kD sUce with BSA or protein A did not yield USE binding 
activity suggesting that Myc plays a specific role in the recovery of C2 
binding activity. 

To examine fiirther the roles of cqpuri^g proteins and of Myc 
modifications in the observed binding, Myc was also purified from a 
bacterial overexpression system. The expression system and purification 
method used were those of Chi Dang and cdlcagues (see materials and 
methods). The bacterially produced protein contains the IgG binding 
segment of protein A fused to the C-terminal 85 amino adds of Myc, the 
segment of Myc which contains the B-HLH and leucine zipper motifs. For 
many of the B-HLH proteins, the smaU region of the protein containing the 
B-HLH motif is not only necessary but My sufficient for DNA binding if 
the correct oUgomerizalion partoer is present. TTiis proton was able to 
form the CI complex on tlie USE probes (Fig. 4, lane 9) and to combine 
with the 26-29 kD factor to create the C2 complex (Fig. 4, lane 10). 
Competition experiments confirmed the specificity of this reconstituted C2 
complex. Tht CI and C2 complexes formed using this bacterial fusion 
protein migrated more r^idly than those formed using fiiU length c-Myc 
(compare Fig. 4, lane 8 with lanes 9 and 10). Hiis may be due to the 
difference in size between the full length c-Myc (60-72 kD) and the protein 
A-Myc fusion protein (-38 kD) and therefore the mobility of the C2 
complex may be interpreted as an indication that Myc is physicaUy present 



CMOCnniTF ^HEET 



-43- 



in the C2 complex, presumably as part of a heteroK)ligomer with the 26-29 
ID factor. 

Analogous experiments were carried out using a similar bacterial 
fiiflon protein containing only the C-terminal 67 amino acids of c-Myc. 
Hiis protein contains most of the HLH domain and the entire leucine zipper 
domain but no baric r^on. Although this protein is capable of forming 
homo-oligomers in solution (Gentz etaL, Science 243:1695-1699 (1989)), 
it was unable to bind to DNA to form the CI complex and was also unable 
to combine with the 26-29 kD factor to create any USE binding activity 
(Fig. 4, lane 12). These data argue that the role of Myc in the C2 hetero- 
oligomer requires an intact baric region, the region responrible for specific 
DNA contacts in other B-HLH proteins. 

Using cut and lenatuze experiments the ^£2 binding activlQr 
responrible for the C2' complex was able to be identified. A small amount 
of the C2' complex was firequently seen with proteins from the slice 
encompassing the 40-50 kD molecular weight range of mammalian Myc 
preparations (Fig. 5A). Although no C2' complex was ever seen with the 
Myc containing slice alone, renaturation of the protein from the Myc slice 
with the 40-50 kD slice reproducibly increased the amount of C2 ' complex 
formed. Both the baculovirus produced Myc and the bacterially expressed 
fiirion protein containing the baric region, which do not form complexes 
themselves on jxE2 probes, were also able to increase the amount of 
complex formed by the 40-50 kD slice obtained from mammalian 
preparations (Fig. 5B and Q. Surprisingly the bacterially produced Myc 
lacking the baric region could also reconstitute C2' activity, while various 
other protons tried including BSA, immunoglobulins, and protein A could 
not. The apparent lack of a role for this baric r^on suggests that Myc's 
involvement in formation of this complex may be other than contacting DNA. 

To further determine whether Myc was present in the analyzed 
complexes, the Myc preparations were incubated with a Myc-specific 
monoclonal antibody prior to EMSA. The probe used in this experiment 
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(SMS) contained a single site with the USE core sequence, CACGTG. The 
Myc-specific antibody eliminated both the CI and C2 complexes and 
produced a prominent complex of slower mobility (Fig. 6). It is not clear 
ftom these data which of the two complexes was supershifted but the 
5 piesenceof one predominant shifted complex when antibody is present and 

two complexes in the absence of antibody argues that the Myc-spedfic 
antibody also completdy disnqjted one of the ori^nal complexes. There 
was no effect of a control monoclonal antibody on the formation of either 
the CI or C2 complex. Hie Myc-spedfic antibody did not alter the C2' 

10 complex, suggesting that Myc is not present in this complex. 

From these experiments it can be conduded that the CI complex is 
formed by Myc alrae, that flie C2 complex contains Myc and a 26-29 led 
fector and that the C2 ' con^lex contains a 40-50 kd factor but does not 
contain Myc. It is intriguing that the C2 ' complex requires the presence of 

15 Myc for formation, but J5>parently does not contain Myc. Myc therefore 
appears capable of affecting the 40-50 kd fector's ability to form the C2 ' 
complex without being a membCT of the complex. WhatevCT the 
mechanism, the increase in mE2 binding activity of the 40-50 IsD factor 
appears to be Myc-specific since four different Myc proteins increased the. 

20 amount of C2 ' complex observed while several other proteins did not. 

Max protein can be immunopiecipitated firom avian and human cells 
and low stringency Southern analysis has suggested that a single Max gene 
or a small family of genes exist in other vertebrates as well (Blackwood 
and Eisenmann, Sdaux 25i:1211-1217 (1991)). It is possible that hamster 

25 -and insect cells have an equivalent of Max. The recovery of a Max-like 
activity from insect cells is particulacdy interesting since no Myc homologs 
have been found in insects to date. Drosophila dearly uses B-HLH 
heterodimers to regulate aspects of development and the possibility remains 
that the natural partner for the 26-29 kD protein in insect cells is an as yet 

30 unidentified B-HLH protein which functions like Myc. 
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The presence of the 26-29 kD factor in these preparations might 
limit their usefulness for certain experiments. By pooling Myc containing 
fractions based on an EMSA assay, one may obtain fractions that contain 
only the CI activity and that do not contain the C2 activity, although this 
5 modification reduces the final yield by a^roximately 80%. 

Example 5 

Selection of Binding Sites For Mvc From Random DNA Sequences 

In order to determine the optimal binding sites for the three 
complexes in the Myc prq>arations described above, a modification of a 

10 recently described technique for isolating preferred binding sites from large 
pools of randomized DNA sequences was used (Pollock and Treisman, 
NucL Acids Res. J8:6197-6204 (1990)), Briefly, a pool of double stranded 
oligonucleotides was created that consisted of 16 base flanking regions of 
defined sequence surrounding a 20 base region of completely random 

15 sequence. Each of the eukaryotic Myc preparations described above was 
mixed with this pool of sequences and the protein DNA complexes that 
formed were separated by EMSA. The DNA that ran at the position of the 
CI or C2 (and comigrating C2 ') complexes was isolated, amplified by the 
polymerase chain reaction (PGR), and used in a second round of EMSA 

20 selection. Either seven (CHO prq)aration) or eight. (baculovirus 

prq)aratibn) rounds of selection in total were performed before subcloning 
individual members of the selected sequences. As each round was expected 
to enrich for better binding sites, the final subcloned oligonucleotides were 
expected to contain high affinity binding sites for the CI, C2, and C2 ' 

25 complexes. In addition, such a procedure should give some indication of 
the relative stringency of selection for a given base at a particular position 
within the binding site consensus. 

The selected sequences were placed in three separate groups for 
analysis (Fig. 7). Group I contains sequences that were selected by the CI 
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complex from either mammalian or baculovirus preparations. These 
sequences were pooled for analysis because \with both prqrarations 
formation of the CI complex requires only Myc protein, and because the 
two sets of sequences (that isolated with mammalian Myc and that isolated 

5 with baculovirus Myc) were similar to each other. Most of the selected 
sequences in this group contained the sequence CACGTG (21/27 of 
sequenced subclones). By aligning aU of the sequences that contained this 
central core sequence, it was found that the sequences flanking this core 
were also nonrandom. A 12 base consensus sequence of 

10 GACCACGTGCTC [SEQ ID. No. 1] was determined for sites selected by 
the CI complex (see table in Fig. 7 for frequencies at each position; for a 
base to be included in the consensus it had to be found in at least 10 out of 
the 21 sequences with a CAC(3TG core). Hie C2 complex from 
baculovirus preparations selected sequences similar to those sdected by the 

15 CI complex (Fig. 7, (jToup n). Most of these sdected sequences also 
contained the CACGTG core (19/22). These sequences had similar 
flanking sequences adjacent to tht core hexamer to those found witii the CI 
complex, although there was a slight preference for GCC over CTC in the 
3 ' flank (see table for Group II in Fig. 7). 

20 As expected, complexes running at the position of C2 that were 

selected by the mammalian Myc preparations bad a greater diversity of 
sequences (Fig. 7, Group HI). Several sequences (8/36) contained the 
CACCrTG core. These sequences were presumably selected by the 
mammalian C2 complex (comprised of Myc and the 26-29 kd factor) and 

25 demonstrated the same flank preferences as the CI complex. Several other 
selected ates (9/36) contained a CAGCTG core sequence presumably 
selected by the C2' complex. In addition, 8 of the 36 sequences were very 
AT rich, and many of the sequences in all three groups contained AT rich 
stretches. This enrichment for AT rich sequences might reflect a 

30 preference of Myc for these sequences, or instead might simply indicate a 
bias arising from the protocol used. It is interesting to note, however, tiiat 
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in previous filter binding experiments, the mammalian Myc preparation has 
demonstrated a preference for binding AT rich sequences within various 
plasmids or lambda genomic DNA. 

To confirm the validity of our site selection procedure a number of 
5 the selected sites individually by EMSA (Fig. 8) were tested. As expected, 
it was found that sequences containing the core CACGTG formed both the 
CI and C2 complexes (Fig. 8, probe groups 1, 2, S, and 6) while 
sequences containing the CAGCTG core formed only the C2' complex 
(Fig. 8, probe groups 7 and 8). Note that the C2' activity is only present 
10 in the CHO derived Myc preparations. No complex formed when selected 
sequences that did not contain a canonical £ box core were tested (Fig. 8 
probe groups 3, 4, 9, 10, and 11). These latter sequences, therefore, do 
not represent high affinity sites for proteins in the Myc prq>arations. 

Example 6 

15 Off Rate Of The CI And CI Complexes 

Off-rates for the Myc containing complexes were measured as a 
means of comparing their affinities. The off-rate of the C2 complex 
formed on the USE probe was approximately 1-2 minutes (Fig. 9, 
baculovirus Myc; siniilar results were obtained with CHO Myc, data not 

20 shown). The CI complex was not fiiUy competed in this experiment using 
250 fold excess of USE competitor. Although competition was not 
complete, the amount of CI complex remaining at the earliest measurable 
timepoint (*0") was significantly less than the starting amount and virtually 
equal to the maximum competition achieved in these e9q)eriments. These 

25 data are indicative of an abundant weakly binding protein with an 
immeasurably fast off-rate. Therefore Myc alone appears to bind 
significantly more weakly than does Myc and the 26-29 kd factor. 
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Jdentificarion of an Inhibitor of c-Mvc 
d Complex Activity in Yeast Cells 

Yeast host cells arc tiansfonned with plasmids carrying a c-Myc 
expression vector (host 'a'); or the c-Myc expression vector and a 26-29 
5 niodalton C2 complex protein identified as above (host 'b'). In addition 
all yeast strains are cotransformed with a plasmid that contains the coding 
sequence for /5-galactosidase operably-linked to the CACGTG sequence 
motif as described above. 

A lawn of each of the transformed yeast strains is spread on agar 
10 plates txntaining X-gal in the medium and small filter disks containing 

compound W, X, Y, or Z are placed on the lawns. The yeast are allowed 
to grow and the plates are monitored for colony growth and colony color 
by visu2 1 observation. Typical results from such an experiment are shown 
in Table 1. 



wo 93/08701 



PCr/US92/08603 



-49- 



Table I: Identification of Inhibitors of C2 Complex Activity 



Compound Yeast Colony Color from 

5 Growfli /5-gal Assay 

withX-gal 



none a + White 

b + Blue 

10 W a + White 

b + White 

X a - - 

b 

Y a + White 

15 b + Blue 

Z a + Blue 

b + Blue 



The results of the above table indicate that compound W prevents 
20 the induction of /3-galactosidease in the 'b' host cells. Therefore, 

compound W is an inhibitor of C2 complex hetero-oligomer formation and 
an inhibitor of c-Myc biological activity. Compound X inhibits the 
giowtii of yeast per se and thus would not be a compound of interest. 

Compound Y does not prevent induction of /3.-galactosidase activity 
25 in die 'b* host cells. Therefore, compound Y is not an inhibitor of C2 
complex hetero-oligomer formation. 

Compound Z shows an interesting effect of inducing jS-galactosidase 
activity in the 'a' host cells which does not contain the C2 complex protein 
used in the 'b* hosts, rather than preventing hetero-oligomer formation. 
30 This suggests that compound Z may induce synthesis of a partner protein 
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which is not otherwise present in the yeast host cells or that it may be (or 
mimic) such a protein. 

From these results, compound W would be identified as an inhibitor 
of C2 complec formation and/or DNA binding and thus of c-Myc 
S ttansciiptional activi^ in nw. 



Example 8 

Trientification ^f m Tnhihitorof c-Mvc 
n' romplcT Activity in Yeast Cells 



Yeast host cells are transformed with two plasmids, each plasmid 
10 canying a C2 ' complex expression vector encoding at least one 40-50 
kilodalton C2' pq>tide (host 'a'); or the c-Myc expression vector in 
addition to the vectors encoding the C2' complex proteins identified as 
above (host 'b'). In addition all yeast strains are cotransformed with a 
plasmid that contains the coding sequence for /3-galactosidase operably- 
15 linlffid to the CAGCTG sequence motif as described above. 

A lawn of each of the transformed yeast strains is spread on agar 
plates containing X-gal in the medium and small filter disks containing 
compound W, X, Y, or Z are placed on the lawns. The yeast are allowed 
to grow and the plates are monitored for colony growth and colony color 
20 by visual observation, laical results from such an experiment are shown 
in Table 1. 
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Table 2: Identification of Inhibitors of C2 ' Complex Activity 



Compound Yeast 



Colony 
Growdi 



Color from 
/3-gal Assay 
withX-gal 



none 



a 
b 



+ 



White 
Blue 



10 



W 



a 
b 



+ 
+ 



White 
White 



a 
b 



15 



a 
b 



+ 
+ 



White 
Blue 



a 
b 



+ 
+ 



Blue 
Blue 



The results of the above table indicate that compound W prevents 
20 the induction of /S-gaiactosidase in the 'b' host cells. Therefore, compound 
W is an inhibitor of C2' complex hetero-oligomer formation and an 
inhibitor of the c-Myc biological activity that is directed towards promoting 
such C2 ' complex heteroH)Iigomer formation. Compound X inhibits the 
growth of yeast per se and thus would not be a compound of interest. 
25 Compound Y does not prevent induction of ^-galactosidase activity 

in the 'b' host cells. Therefore, compound Y is not an inhibitor of C2 
complex hetero-oligomer formation. 

Compound Z shows an interesting effect of inducing /3-gaiactosidase 
activity in the 'a' host cells which does not contain the Myc protein used 
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in the 'b' hosts, rather than preventing hetero-oligomer formation. This 
suggests that compound Z may induce synthesis of a protein that can 
substitute for Myc in promoting formation of the a' complex which is not 
otherwise present in the yeast hqst cells or that it may be (or mimic) such a 
protein. 

From these results, compound W would be identified as an inhibitor 
of C2' complex formation and/or DNa binding activity and thus of c-Myc 
transcriptional activity in vivo. 

All references dted herein are fiiUy incorporated by reference. 
Having now fiiUy described the invention, it wiU be understood by those 
with skiU in the art that the scope may be performed within a wide and 
equivalent range of conditions, parameters and the like, without affecting 
the spirit or scope of the invention or any embodiment thereof. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Kingston, Robert E 
Papoulas, Ophelia 

(ii) TITLE OF INVENTION: C-MYC DNA BINDING PARTNERS, 
MOTIFS, SCREENING ASSAYS, AND USES THEREOF 

(ill) KDHBER OF SEQUEMCEIS: 101 

(iv) CORRESPONDEKCC ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox 

(B) STREET: 1225 Connecticut Avenue, N.W,, Suite 300 

(C) cm: Kashingrton 

(D) STATE: DC 
<E) COUNTRY: USA 
(F) ZIP: 20036 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version /1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APFUCATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INPORMATIOKi 

(A) RAKE: Clfflbala, Kichele A 

(B) REGISTKATION NUMBER: 33,851 

(C) REFERENCE /DOCKET NUMBER: 0609.3440004 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202)833-7533 

(B) TELEFAX: (202)833-8716 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GACCACGTCC TC 12 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS; single 

(D) TOPOLOGY I linear 

(ii) MOLECULE TYPE: ONA 



(xi) SEQUENCE DESCRIPTIOH: SEQ ID NO: 2: 
CACCACGTGG TC 12 
(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACZtRISTICS: 

(A) LENGTH: 12 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOCrx linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AACAGTSTCTG TT 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CBARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
{C} STRAMDEDNESSt single 
(D) TOPOIOGT: linear 

(ii) MOLECULE TTFE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCGATATCTC CAGACCG 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID-NO:S: 
CGACCATCTA CATTOG 
(2) INFORMATION TOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GATCTCTGCA CCAGCTCGCA GCAGCTGGCA CCACCTGGCG 40 
(2) IKFORMATIOM FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 baee pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TTPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CATCTGCAGT CATCTCCCCT CATCTGGCGT CATGTCGCAG 40 
(2) INPORMATION FOR SEQ ZD NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 baee palrfi 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CATCTGCAGT CACGTCGCGT CACGTGGCGT CACGTGGCAG 40 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TOGAOGTCGC AGCAGGTGCA G 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOI-OGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

21 

TCCACCCCAC CAGCTCGCGA C 

(2) INFORHATION PGR SEQ ID KO:ll: 

(1) SEQUENCE CPARACTERISTICS: 

(A) I£RGTH2 40 base pairs 

(B) TYPES nucleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

40 

AGCTTCCAAC ACCTGCAGCA CCTCCCAGCA ACCAGCCCTA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ICTCTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) SXRAKDEDNESSs single 

(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGCTTTAAAA TCCCCACCAG CTCGCGAAGC AACAGGTGCA ^0 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGYs linear 

(ii) MOLECUI£ TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AATTCCGAAA CCCCTGCAAT ATTCCGACCT CGCAGCCTC 39 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base paxrs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCGACTTTAC ACCACGTCCT CCCCTCCA 28 
(2) IHFORMATION FOR SEQ ID K0:1S: 

4 

(i) SEQUENCE CHARACTERISTXCS: 

(A) LENGTH: 52 base pairs 

(B) TYPES nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOXXXSYx linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGAOGATCTA GATTOGNNNN NNNNNNNNNN NNNNNNCCGT CTOGACTATC GG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) XiSNGTHs 10 base pairs 

(B) TYFE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
NNCANNTGNN 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCAGAATCTA CCAOGT G CT C C 21 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCGGCTACCA CGTGCrTATG 
(2) IHFORMATIOH FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TSfFEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGYt linear 

(11) KOLECOLE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

20 

GGACGAAAGC ACGTGCTCCG 

(2) INFORMATION FOR SEQ ID N0:20: 

(I) SEQUENCE CBARACTERISTICS: 

(A) LENGTHS 20 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGrs linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

20 

GCACATGACC ACGTGCTCTG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

on 

CGCAGAGACA CGTGCCCTGG 

(2) INFORMATION FOR SEQ ID Nd:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGCAAACCAC GTGTTATGTG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(6) TTPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOCY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
OGAOCACGTG CTCTTOGACT TG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 baee pairs 

(B) T^TPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCACAATTTG TACCACGTGG CCG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGACAACAIC CACCAOGTCG COG 23 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTIOK: SEQ ID N0:262 
CCCTCCATCA CCACCTGGAC C 21 
(2) IHFORMATIOK FOR SEQ ID KO:27: 

(i) SEQUENCE CHARACTERISTICS £ 

(A) LENGTH: 21 base pairs 

(B) mZx nucleic acid 

(C) SXRANDEDNESSx single 

(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GCAAAXATGA CCAOGTGGTA C 

(2) IKPORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
{K) LENGTHS 20 base pairs 

(B) T^TFEs nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGr: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 
CGACCACGTG CTCTTTTGTG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

- (ii) MOLECULE XyPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGCATAAACT CCACGTGGTC C 
(2) INFORMATION FOR SEQ ID N0:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
CCGGCACCTG CTCCTCGGAC TC 22 
(2) INFORMATION FOR SEQ ID,N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GGTAGCAAAA AGCACGTGCC CC 22 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHAKACTCRISTICS: 

(A) LENGTHS 22 base pairs 

(B) TYPES nucleic acid 
<C} STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GGGGCATTTA AGCACGTGCT CC 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

CACCTATTAA CCACGTGGTA C 21 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base paire 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

24 

CACCAOCCGG CATCCAOCTG CCCT 

(2) INFORMATION FOR SEQ ID NOs35: 

(I) SEQUENCE CHRRACTERISTICS: 

(A) LENGTH: 20 base paira 

(B) TTPEs nucleic acid 

(C) 6TRAHDEDHESS: Bingle 

(D) TOFOLOGT: linear 

(11) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

20 

CGCGACCACG TCCrCCGTTG 

(2) INFORMATION FOR SEQ ID NO:36: 

(I) SEQUENCE GBARACTEHJSXICSt 

(A) LENGTH: 22 baac pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. (11) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

22 

CACATATTAG ACCACGTGCT CC 

(2) INFORMATION FOR SEQ ID NO: 37: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(Dl TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

23 

CCGCCACGTC CTCACTCTCT ACC 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base paxrs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
CCATCGACAG CTTCTTCCTG 
(2) INFOBMATION FOR SEQ lb NO: 39: 

(i) SEQUENCE CBABACTERISTICS: 

(A) LENGTB: 20 base pairs 

(B) TTPE: nucleic acid 

(C) STRAHDCDNESSs single 

(D) TOPOLOGY^ linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
GCAATCCCCC GCTGCTOGCC 
(2) INFORMATION FOR SEQ ID NOt40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTB z 22 base pairs 

(B) TYPES nucleic acid 

(C) . STRANDEDNESS 2 single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 
GCCAAAAATG TACAGCTGTC CC 
(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

CCGCCAOGAG CTCATCAATG TCC 

(2) INFORMATION FOR SEQ ID NO:42: 

(t) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 baee pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CCXGCCTCTA CCTGACTTGC 
(2) INFORMATION FOR SEQ ID kO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IfiNCTHs 20 base pairs 

(B) TTPE: nacleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCGCAGTCCT GGTGCTCTCC 
(2) INFORMATION FOR SEQ ID NOt44s 

(1) SEQUENCE CBARACTCRISTICS: 

(A) I£NGTHs 23 base pairs 

(B) TXPEs nucleic acid 

(C) STRAKDEDNESS2 single 

(D) TOPOLOCy: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
CACTAAGAAA TACCACCTGG CCC 
(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CGCGAITTAA CCACCTGCTC C 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
CGCCCACCTG CCTTCTTTCT COG 
(2) INFORMATION FOR SEQ 10 110x47: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPES nacleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
CATAGTCCAC ACAGCACCTG CCAA 
<2) INFORMATION FOR SEQ ID NO: 48: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) .STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
CATAAGTCAG ACCACGTGGC CG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

OCCAACTAAG ACCACGTGGC CG 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
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(B) TCPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXJy: linear 

(ii) MOLECULE TYPEr DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CAGTCGAAGA CCCCACGTGG CGA 23 
(2) INFORMATION FOR SEQ ID NO: 51; 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTBs 23 base pairs 

(B) TTFE: nucleic acid 

(C) STRANDEDNESS: single 

(D) WPOLOGYs linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
CCTAGGTTAT TCCCAOGTGG COG 
(2) INFORMATION FOR SEQ ID NO:S2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH: 21 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS:. single 
{D) TOPOLOCyx linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CATAAATAGG CCACGTGCTC C 
(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 21 base pairs 

(B) TSTPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPE; DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 53: 

GGAAAATGTA CCACGTGCTC C 21 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGAACAGACC ACGTGGCTTG 20 
(2) INFORMATION FOR SEQ ID' NO:SS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAXn)£DNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CTACCAOGTG CTTTTTTGGC 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHAKACTERISTICS: 

(A) I£NGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: -single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 56: 
CAGTCCGAGG AGCACGTGCC CG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CCGCCAOGTG TOGAGCATGA GTC 23 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 



^IIR^Tm ITP SHEET 



wo 93/08701 



- 68 - 



PCr/US92/08603 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

23 

CCGCCACCTG CTCGTAAATT TGC 

(2) INFORMATION FOR SEQ ID NOsS9: 

(t) SEQOENCE CHARACTERISTICS: 

(A) LENGTHS 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:S9: 

23 

CCGACAAAAT TACCACCTCG CCG 

(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 
CGCAAAATCG ACCACGTGGT CC 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic -acid 

(C) STRANDE02IESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 61: 

23 

CCATAAGTAA TACCACGTGG CCG 
(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 



22 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 



(Xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 62: 
GCAAAAAAAC CACCTCGTCC 20 
(2) IKFORKATION FOR SEQ ID NO: 63: 

(i) SEQUEKCS CBARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TTFE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GCCCCCGGAA CTCC G TT G TC 20 

(2) INFORMATION FOR SEQ ID NO: 64: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(6) TYPE: nacleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GGGGACCCGA TCTCTCGCTG 20 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CAATAATATT TCCTTTCCTG 20 
(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STKANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CTCCACGOGC CATCCACGTG CCGT 
(2) INFOIWATION FOR SEQ ID KO:67: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 23 ba«c pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY r linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

23 

CGCCCAOCTG CTCTATACAT GCC 

(2) INFORMATION FOR SEQ ID NO:68: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 20 base pairs 

(B) TYPES nucleic -ecid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

20 

GGACCACGTG CTTATCTTTG 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CCACCACGTG TTCCGCTACT CG 
(2) IHFOPMATION FOR SEQ ID N0:70: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLCXSY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

06AGTAGCGA GCACGTGTTG C 

(2) INTORMATION FOR SEQ ID NO:71: 

(i) SEQUENCE CEARACTERISTICS: 
(X) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CCACCACGTG CTTACCATGT C 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucloic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CGACAAAAAG CACGTCCTAC 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73:, 
GCAAAACTCC AOGTGGTCGG 
(2) INFORMATION FOR SEQ ID NO:74: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESSs single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUEKCE DESCRIPTION: SEQ ID NO:74: 
CGCCAAAAAC AACAGCTCTO CG 
(2) INFORKMIOM FOR SEQ ±0 KO:7S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH: 21 base pairs 

(B) TYPE: irocleic acid 

(C) STRANDEDKESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGCAAACAGA TCACCTGTCC G 21 
(2) INFORMATION FOR SEQ ID NO: 76: 

(1) SEQOENCE CEARACTERISTICS: 

(A) £ENGTB: 21 base pairs 

(B) -TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

-21 

GGAGAATTGA ACAGCTGACC C 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Jxi) SEQUENCE DESCRIPTIONS SEQ ID NO: 77: 
CGGACAAACC AGTCAGCTGG CCG 
(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 
GGGCACAGCT GTTTAGTGGG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pair 8 

(B) TTFE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
CGCAAGOGGA CAGCTGTTCC 
(2) INPORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CGCATTGATC AGCTGTGTGG 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID H0:81: 
GCAAAAACCA CCTGGTCCCC 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 



CI IQCTTTI ITC CUCPr 



wo 93/08701 



PCr/US92/08603 

- 74 - 



(A) LENGTH: 21 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIjOGY: linear 

(ii} MOLECULE TYPE: DNA 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CGChKQXGXh ACAGCT G GI G C 
(2) INFORKATIOK FOR SEQ ID HO: 83: 

(I) SEQUENCE CBAKACTERISTICS: 

(A) I£NCIH: 20 base pairs 

(B) TTFEs tmcleic acid 

(C) STRANDEDNESS: single 

(D) TOPCrXXTf: linear 

(11) MOLECULE TYTO: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:83: 

20 

CCATCGTrTT TTTTTTCTAC 

(2) INFORMATION FOR SEQ ID NO:84: 

(1) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 20 baae pairs 

(B) TYPEx nucleic acid 

(C) STRARDEDNESSs single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

-20 

GCATGATTTT CTTTTTGTCC 

(2) INFORMATION FOR SEQ ID NO: 85: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID K0j85: 

20 

CAGACTTTTT TTCAGCCCCC 

(2) INFORMATION FOR SEQ ID NO: 86: 

(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CCAAAAAATA AAAATACATC 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

20 

GGCAAAAAAG TCAAAATACG 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
GCACAATAAA AAACTTTGCG 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
CCATATGTTC ATTGTTGTCC 
(2) INFORMATION FOR SEQ ID NO: 90: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:90: 
GACAAAAATT TAGTGTGTGC 
(2) INFORMATION FOR SEQ ID' N0:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I2NGTHX 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
CGGCCCOGTG CTCTAGCCCA TGC 
(2) INFORMATION FOR SEQ ID NO:92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 
CGGGGAAGTC CCAAGTGCCC C 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic -acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
CACAGGAACA TACACGGGCC CG 
(2) INFORliATION FOR SEQ ID NO: 94: 
(i) SEQUENCE CHARACTERISTICS: 



r»i mf t i ll ITC CUCCT 



wo 93/08701 



- 77 - 



PCT/US92/08603 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 
CGCACGGGAT CATTGACCTG COGT 24 
(2) INFORMATION FOR SEQ 10 NOs95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPES nuclsic acid 

(C) STRANDEDNESS: single 

(D) TOPOIXX5YS linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9S: 
OCCAACOGAC GTCACTCCTG 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
CACCTACCAC TGATCGCGGC 
(2) INFORMATION FOR SEQ ID NO:97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97t 
CGACAAACAT CCCATTACCC 20 
(2) INFORMATION FOR SEQ ID NO:98: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPEr nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOIXXJY: linear 

(ii) MOLECOI.E TTPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:98: 
GGGGATGGAA CATCGOGCTC 
(2) IHFORHATIOH FOR SEQ ID KO:99: 

(i) SEQUENCE CEARACXEKISTICS: 

(A) LENGTHS 20 base pairs 

(B) nPE: nucleic acid 

(C) STRAKDEDNESS s single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TTPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 
CCAGTCGGGC CTAACCGGCC 
(2) IRFORMMION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GGGAGCCATC CACGCCGGTG 20 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CCATAGGGGA CTTGACAGCC 20 
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WHAT IS CLAIMED IS: 

1. A method for the purification of Myc from a mammalian 
source, wherein said method cpmpiiscs: 

(a) growing mammalian cells capable of expressing c-Myc; 

(b) indudng c-Myc eaqnesdon in said cells; 

(c) lyang Hit membrane of said mammalian cells and purifying 
nucld therefrom; 

(d) treating said nuclei in a buffer comprising DNase I; 

(e) solubiliiing said nuclei in a buffer comprising sodium 
dodecyl sulfiale or urea at concentrations greater than 4 M 
and sqjarating the nuclear pellet from the supernatant 
fraction; 

(f) applying said suDcmatant fraction of step (e) to a DEAE 
Scpnarose CL^ column and eluting bound c-Myc from said 
DEAE Sqpharose CL-6B column with a salt gradient; 

(g) applying said c-Myc of step (f) to a FPLC Mono-Q column 
and eluting bound c-Myc with a salt gradient. 

2. A method for the detection of CI complexes in a sample, 
wherein said niethod comprises detecting DNA binding of c-Myc- 
containing homo^ligomers to the DNA motif 5'-CACGTG-3\ in its double 
stranded DNA form. 

3. A method for the detection of C2 complexes in a sample, 
wherein said method comprises detecting DNA binding of c-Myc- 
containing hetero-oligomcrs to tiie DNA motif 5'-CACGTG-3\ in its 
double stranded DNA form. 
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4. A method for the detection of C2 ' complexes in a sample, 
wheidn said method comprises detecting c-Myc directed DNA binding to 
the DNA motif 5'-CAGCTG-3', in its double stranded DNA fom. 

5. A piotdn composition comprising at least one pq)tide 

5 oqyable of forming a d complex, wherein said pqitide actable of forming 
a C2 complex is found in a 26-29 Id) protein fraction puziiied from 
Chinese hamster ovary cells or baculovirus. 

6. The protein composition of daim 5, wherein said protein 
composition is prepared from Chinese hamster ovary cells by a method 

10 comprising the st^s of: 

(a) growing said cells; 

(b) Ivsins the membrane of said cells and puri^g nudd 
tbereftom; 

(c) treating said nudd in a buffer compri^g DNase I; 
15 (d) solubilizing said nudd in a buffer comprising sodium 

dodecyl sulfete or urea at concentrations greater than 4 M 
and separating the nuclear pellet from the supernatant 
fraction;, ' 

(e) applying said supernatant fraction of step (e) to a DEAE 
20 Sq>harose CLr6& column and tiie bound C2 complex protdn 

from said DEAE Sepharose CL-6B column with a salt 
gradient; and 

(g) flying the duted C2 complex protein of stq) (f) to a FPLC 
Mono-Q column and duting bound C2 complex protdn with 
25 a salt gradient 

7. The protein composition of claim 5, wherein said protein 
composition is prepared from baculovirus. 
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8. A protein composition comprising at least one peptide 
capable of forming a C2 ' complex in the presence of c-Myc, wherein said 
pq)tide capable of forming a C2 ' complex in the presence of c-Myc is 
found in a 40-50 kD protdn fraction purified from CHO cells. 

9. The protein composition of claim 8, wherein said protein 
composition is prepared from Chinese hamster ovary ccUs by a method 
comprising the sbsps of: 

(a) growing said cells; 

(b) lysing the membrane of said cells and purifying nuclei 
therefrom; 

(c) treating said nucld in a buffer comprising DNase I; 

(d) solubiliang said nucld in a buffer comprising sodium 
dodecyl sulfate or urea at concentradpns greater than 4 M 
and separating the nuclear pdlet from the supernatant 
fraction; 

(e) applying said supernatant fraction of step (e) to a DEAE 
Scpharose CLr€B colunm and the bound C2 ' complex 
protein from said DEAE Sepharose CL-6B column with a 
salt gradient; and 

(g) applying the duted C2 ' complex protein of step (0 to a 
FPLC Mono-Q column and eluting bound C2 ' complex 
protein with a salt gradient. 

10. A method for objectively classifying compounds, including 
human pharmaceuticals, as iiihibitors of c-Myc activity, wherein said 

25 mediod comprises detecting the ability of said compoimd to inhibit CI 
complex formation, CI complex formation or C2' complex formation. 

11. The method of claim 10, wherein said complex formation is 
CI complex formation. 



10 
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-Si- 
ll. The method of claim 10, wherein said complex fonnation is 
C2 complex fonnation. 

13. The method of claim 10, wheidn said complex formation is 
C2' complex formation. 

5 14. A method for objectively dasa^ confounds, including 

human pharmaceuticals, as inhibitors of c-Myc activity, wherein said 
method comprises detecting the abiUty of said compound to inhibit CI 
complex DNA binding, C2 complex DNA binding, or C2 ' complex DNA ^ 
binding. 

Xo 15. The method of daim 14, wherdn said DNA binding is CI 

conqilex DNA binding. 

16. The method of claim 15, wherdn said DNA binding is to an 
oligonudeotide comprising the sequence 5'-CACGTG-3'. 

17. The method of claim 15, wherein said DNA binding is to an 
15 oligonudeotide comprising the sequence 5'-CATGTG-3'. 

18. The method of claim 14, wherein said DNA binding is C2 
complex DNA binding. 

19. The method of claim 18, wheirin said DNA binding is to an 
oUgonudeotide comprising the sequence 5'-CACGTG-3'. 



20 



20. The metiiod of claim 18, wherein said DNA binding is to an 
sligonudeotide comprising the sequence 5'-CATGTG-3' . 
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21. The method of claim 14, wherein said DNA binding is C2 ' 
complex DNA binding. 

22. The method of claim 21, wherein said DNA binding is to an 
oligonucleotide comprising the sequence 5'-CAGCTG-3'. 

5 23. A method for the purification of a peptide capable of forming 

a C2 or C2 ' complex, or a mixture of such pq)tides from a crude prepara- 
tion, wherein said method comprises extraction of Chinese hamster ovary 
cells and assay of said pq)tide by detection of the ability of said peptide to 
form said C2 or said C2' complex. 

10 24. A method for identifying and classifying a compound as an 

inhibitor of c-myc heteroK)ligomer DNA binding wherein said method 
comprises evaluating the ability of said compound to alter e^wcssion of a 
reporter gene in a host cell, wherein expresaon of said rqwrter gene is 
operably-linked to DNA binding by said hetero^ligomer to an 

15 oligonucleotide comprising the sequence 5'-CACGTG-3'. 

25. The method of claim 24, wherein expression of said reponer 
gene induces a phenotypic change in a host cell. 

26. The method of claim 24, wherein said reporter gene is larZ. 

27. The method of claim 24, wherein said reporter gene is.CAT. 

20 28. The method of claim 24, wherein said rqwrter gene is 

LEUl. 

29. The method of claim 24, wherein said phenotypic change is 
detected by visual inspection of the host cell. 
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30. The method of claim 24, wherein said host is 5. cerevisiae. 

31. Hie method of claim 24, wherein said host is a mammalian 



ceU. 



32. A method for ideating and classifying a compound as an 
inhftitor of c-MycKiiiected C2' hetenwligomer DNA binding wherein said 
method comprises evaluating the ability of said compound to alter 
expression of a rqwrter gene in a host ceU, wherein expression of said 
rqwrter gene is operably-linked to DNA binding by said hetero-oUgomer to 
an oligonucleotide comprising the sequence 5'-CAGCrG-3'. 

33. TTie method of daim 32, wherein expression of said reporter 
gene induces a phenotypic change in a host cdL 

34. The method of daim 32. wherein said rqwrter gene is lacZ. 

35. The method of claim 32, wherein said reporter gene is CAT. 

36. The method of claim 32, wherein said reporter gene is 



15 LEU2. 



37. The method of claim 32. wherein said phenotypic change is 
detected by visual inspection of the host cell. 

38. TTie method of claim 32, wherdn said host is S. cerevisiae. 

39. The method of claim 32, wherein said host is a mammalian 



20 cell. 
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BOX II. OBSERVATIONS WHERE UNITY OF INVEN HON WAS LACKING 
This ISA found multiple invenciofu as foUowt: 

This tppUcation conuins the following Lnventions or groups of inventions which arc not so linked as co form a 
single inventive concept. In order for tU mvenrions to be exAmined* the appropriate additional examination fees must 
be paid. 

Group I. claims i and 23, drawn to a method of purifying Myc, classified class 530, subclass 412 and 417. 
and claims 5-9. drawn to a protein classified in class 530. subclass 350. 

Group II. claims 2-4. drawn to a methcu of detection, classified in class 435. subclass 6. 

Group HI. claims 10-22 and 24-39, drawn to a method of identifying and/or classifying compounds, said 
claims are classified in class 424. subclass 2. 

The inventions listed as Groups I and II do not meet the requircmenu for Unity of Invention for the following 
reasons: The invention of Croup I is considered to form an inventive concept wherein said firat method is directed to a 
purification of Myc and ttie product ii at least one peptide capable of fonning a 02 complex and that satd peptide is 
found in a 26-29 IcD protein fraction purified from Chinese hamster ovary ceils or baculovirus. The invention of Group 
II is dnwn to different mcUiodologics, requiring different method steps and resulting in a different end product. In 
particular, the invention of Group II is directed to the detection of CI complexes. 

Additionally, the peptide of claim 5. "a protein composition comprising at least one peptide capable of 
forming a C2 complex 'is admittedly and agreeably old in the art. As can be seen on page I . bridging (o page 2 of the 
disclosure, myc has long been known m the art as well as iu sequence. Also, the prior art of record clearly 
demonstrates that myc has been expressed under recombinant conditions. Resultantly. the peptide of claims 5-9 cannot 
be considered as a special technical feature. Lacking such status, the holding of a lack of unity is justified. 

The inventiofu listed as Groups 1 and 111 do not meet the requircmenu for Unity of Invention for the following 
reasons: As shown supra, the invention of Group I is comprised of a first method of making and a product made where 
said first method is directed to the purification of Myc and the product is at least one peptide capable of fonning a C2 
complex and that said peptide is found in a 26-29 kO protein fraction purified from Chinese hamsur ovary cells or 
baculovirus. The invention of Group III is directed to '(aj method for objectively classifying compounds, including 
human pharmaceutical*, as inhibitora of c-Myc acuvity" (* 'aims 10-22) and "[aj method for identifying and classifying a 
compound as an inhibitor of c-myc hereco-otigomer DNA binding' (claims 24-39). As can be clearly seen, the 
inventions of each group are drawn to different methodol< ^ies. requiring different method steps and resulting in 
different end products. 

The inventions listed as Groups II and III do no meet the requirements for Unity of Invention for the 
following reasons: The inventions of Group II is drawn i j a method of detection of CI complexes while the mveniion 
of Group III is drawn to "(a] method for objectively classifying compounds, including human pharmaceuticals, as 
inhibitora of c-Myc activity* (claims 10>22) and '[a] method for identifying and classifying a compound as an inhibitor 
of c-myc hetero-oligomer ONA binding" (claims 24-39). Clearly said Groups arc drawn to methodologies that each 
require different method steps and result in different end products. 
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Box I Obscrrataoiu where ccrUia cUims w«rc fouod unsearchable (Cootinuatioa f item i of nrst sheet) 



This intcmationaJ report has not been otablished in respect of certain claims under Article I7(2)(i) for the following reasons: 
Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



1. I I Claims Nos.: 



2. Q Claims Nos. 



because they reUtc to parts of the intcmalional application thai do not comply with the prescribed rcquircmcnu to such 
an extent that no meaningful international search CAn be carried out, specifically: 



3- I I CUims Nos.: m . 

— because they are dependent claims and are not drafted in acconlance wiih the second and third sentences of Rule 6.4<a). 



Box II ObserTattoiis where unity of uiTentioa is lacking (Continuation of item 2 of Hrst sheet) 



This International Searching Authority found multiple inventions in this international appUcation, as foUows: 
Please See Extra Sheet. 



1 . □(] As aU required additional search fee* were timely paid by the applicant, thU international search report covers all searchable 

claims, (Telephone Practice) 

2. Q As aU searchable cUims could be searched without effort justifying an additional fee. this Authority did not inviic payment 

of any addtional fee. 

3. rn AS only some of the required additional search fees were timely paid by the appUcant, thU inumational search report wo vers 
— only those claims for which fe«» were paid. ipccificaUy claims Nos.: 



PI No required additional search fees were timely paid by the applicant. Consequently, this international search rcp.)^ ,s 
— restricted to the invention frist mentioned in the claims; it is covered by claims Nos.; 



Remark on Protest Q The additional search icc* *crv accompanied by the applicant s protest. 

|~| No protest accompanied ihc payment of additional search fees. 
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